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COMPLETE DIFFERENCE IDEALS.* 
By J. F. Rirt. 


If the manifold Yt of an ideal ¥ of differential polynomials is composed 
bof two manifolds Yt, and Mt, with no solution in common, & has a unique 
Irepresentation as the product of two ideals whose manifolds are respectively 
/M, and Mt..1 We show here that no result of this degree of generality holds 
I for difference polynomials,? and then direct our attention to a natural class 
of difference ideals, the complete ideals, for which a theorem on separated 
| manifolds can be derived. While we are chiefly interested here in difference 
q equations, our decomposition theorem, Theorem II of §10, is distinctly 
"Noetherian in form, so general, even, as to unite the theory of algebraic 
F difference equations with the theory of numbers. 


Difference Ideals and their Products. 


1. We use a difference ring @ as in R. R., $1. In defining difference 
ideal in R. R., § 3, it was stipulated that the presence in a difference ideal x 
of the transform a, of an element a must imply the presence in w of a. This 
| restriction upon the nature of a difference ideal, made to insure that every 
prime ideal be perfect, is a potential source of inelegance for the investigation 
which we are about to undertake. In meeting this situation, our first step 
will be to call the difference ideals of R. R. reflexive difference ideals. 

3 In the present paper, an ideal + (in the sense of algebra) contained in & 
| will be called a difference ideal if the presence in + of the element a of & 
| implies the presence in z of the transform of a. A difference ideal x can be 
extended into a reflexive difference ideal by the adjunction to z of every 


/ element which has a transform of some order in 7; one obtains thus the 


“intersection of all reflexive ideals containing 7. 


* Received February 17, 1941. 

1Ritt, Proceedings of the National Academy of Sciences of the U. 8. A., vol. 25 
p (1939), p. 90; Ritt and Kolchin, Bulletin of the American Mathematical Society, 
' vol. 45 (1939), p. 895. 
3 * Difference ideals are treated by Ritt and Raudenbush, Transactions of the 
| American Mathematical Society, vol. 46 (1939), p. 445. This paper will be denoted 
by R. R. We would mention here that minor modifications of the argument in R. R., 
$§ 11,12 lead to the result that every infinite system = of difference polynomials can 
| be derived from any of its bases with a finite number of shufflings. 
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A difference ideal x will be called perfect if whenever a is such that some 
product of integral powers of transforms of a is in z, a is also contained in z. 
A perfect ideal as thus defined is reflexive; hence the perfect ideals of the 
present paper are the same ideals which were called perfect in R. R. 

A difference ideal x will be called prime if x is a prime ideal in the sense 
of algebra; a prime ideal is perfect only if it is reflexive. 

For a ring @ with a basis theorem, the theorem of R. R., § 8 now becomes: 
Every perfect ideal in ® has a unique representation as the intersection of a 
finite number of prime ideals none of which contains any other. In this repre- 
sentation, the prime ideals are perfect. 


2. Given a set o of elements of ®, we denote by [o] the difference 
ideal * generated by o, that is, the intersection of all ideals containing o. An 
element a is in [o] if and only if a@ is a linear combination, with coefficients 
in ®, of elements of o and their transforms of various orders. We denote by 
{a} the perfect ideal generated by o. 

The product or of two ideals o and r will be defined as the ideal generated 
by the totality of elements ab where a is any element in o and Bb any in +. 
The product, as thus defined, of o and + is identical with their product in the 
usual sense of algebra; this results from the fact that the transform of the 
product of two elements is the product of their transforms. 

Let o and + be ideals. We shall prove that 


(1) {or} = {o} * {7}. 
The second member of (1) contains the first member. We show that the first 
member contains the second, thereby establishing not only (1), but also the 
relation 
(2) {oor} = {o} 9 {7}. 

In the lemma which follows, the meaning of on, for any set a, will be as 
in R. R., § 5.4 


Lemma. Leto andr be ideals. For n=1, if aeon and betn, abe (or) 


First, let m1. Some product ¢ of transforms® of a is in o Some 


product d of transforms of 6 is in +r. Then cdeor. Some product of trans- 


°In the remainder of this paper, ideal will mean difference ideal. 

* One will notice that square brackets as used there have a meaning identical with 
that established here. 

5 The parentheses about o7 are merely symbols of inclusion. 

® More fully, a product of powers of a and its transforms. 
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forms of ab is a multiple of cd. Then abe (or),. Now let n= 2. There are 
ac and a d, related as above to a and to b, with expressions 


where the w are in o, and the v in 7,. Now cd is a sum of terms kujv;. By 
the case of n = 1, each of these terms is in (or);. This puts ab in (or)>. 
The proof continues by induction. 

We now complete the proof of (1). Let a be in {o} and in {r}. There 
is an n such that a is in o» and in t,. By the lemma, a? is in (o7)» so that 
ais in (or), and thus in {or}. 

We have, immediately, the theorem : 


THEOREM I. For any ideals o1,° + +, 08, 


° = {o,%: : = {a,} {os}. 


Abstract Manifolds. 


3. It is well to make clear at this point the notion of the manifold of a 
system of difference polynomials (forms). Let F be a difference field and > 
a system of forms in the unknowns 4,,°° -, Yn with coefficients in F¥. A dif- 
ference field F’ is called an extension of F if F contains F¥. Here it is 
understood that if a is an element of ¥ and if a’ is the transform of a before 
# is absorbed into #’, a’ remains the transform of a after the absorption. 
If an extension of § contains a set of n elements a,,° °° ,@, which are such 
that the substitution for each y; and its transforms, in the forms of %, of aj 
and its corresponding transforms reduces each form of = to 0, we shall call 
@,,°* *,@ a solution of &. The totality of solutions of 3, obtained from all 
extensions of F in which solutions exist, will be called the manifold of %.’ 

A situation is met in connection with difference equations which is 
anomalous insofar as it does not occur in the theory of algebraic equations. 
It is possible for each of two systems 3, and 3, with coefficients in a single 
difference field F to have a non-vacuous manifold and still for no extension of 
FY to exist in which 3, and 3%. both have solutions.’ Let ¥ consist of all 
rational numbers. We use forms in the single unknown y. Let 3, consist of 


7 Here we follow a standard procedure of algebra, inaugurated by van der Waerden, 
Vague in nature as the totality of solutions may be, considerations of language justify 
its toleration; in any particular situation, there is no vagueness. 

SIt is to be emphasized that we are talking of solutions from the standpoint of 


algebra and not from that of analysis. 
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y? +1 and y,—y;3° & of and y,+y. has the solution y= (—1)! 
in the field of complex numbers, each number its own transform; 32 has the 
solution y = (—1)! in the field of complex numbers with the transform of 
each number taken as the conjugate of the number. Suppose that there is 
an F’ in which 3, and &. both have solutions. Let a be a solution of 3%, in 
F’. Then —a is the only other solution of 3, in ¥’. Now a and —a are the 
only possibilities for solutions of 3. in FY’. However, as they annul y, —y 


and are not zero, they do not annul y, + y. 


Separated Ideals. 


4. Two ideals o and 7+ in #& will be said to be separated if the perfect 
ideal generated by the logical sum of o and 7+ contains unity; that is, in 
symbols, if 1¢{o,7}. If o and + are ideals of forms, they are separated if 
and only if they have no common solution.*® 

Two ideals o and + will be said to be strongly separated if elements a 
and b, belonging respectively to o and to 7, exist such that a+ b —1. 

We give an example of two perfect ideals of forms in y which are sepa- 
rated but not strongly separated. We use the field of rational numbers. 

Let 

A=1+yy; B=y+ty; {A,B}. 
As 
1— = A— wB, 


contains 1—y’*. Let C= (1—y)(1+ 4). The product of C by its 
transform is divisible by the transform of 1—y*. Hence Cex. If we replace 
y, in C by B—y, we find that (1— y)*«X. The substitution y= B—y 
into C shows that (1+ 4,)*«3. Then (1+ y)?e%. Thus & contains 1—y 
and 1+ y and hence contains unity. Hence {A} and {B} are separated. 

Now if A=0O and BO are regarded as difference equations in the 
analytic function y of the variable z, B = 0 is satisfied by and A 0 
is satisfied by — ie9'*) where g(a) = (7i/2)e™**. Each of these analytic func- 
tions assumes the value 1 for even integral values of z and the value —1 for 
odd values of z. Thus all forms in {A} and in {B} vanish when each y; with 
j even is replaced by 1 and each y; with 7 odd by —1. Hence {A} and {B} 
are not strongly separated. 

The strong separation of two ideals of forms 3, and 3, thus involves the 


°y, =y(@+1). 
10 This follows from the Nullstellensatz of R. R., § 13. 
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incompatibility of the systems 3,0 and %,—0 considered as systems of 
algebraic equations in an infinite set of unknowns. 

Strongly separated ideals are met in connection with algebraically irre- 
ducible forms whose manifolds are reducible. Thus, let 


A= +1. 
Then 
(3) {A} = {A, B} « {A, C} 


where 


B=y,—y, 


Unity is a linear combination, with constant coefficients, of B, C and their 
first transforms. Thus the ideals in the second member of (3) are strongly 


separated. 


5. If o and 7 are strongly separated ideals, or =o%+r. For the proof 
let a+ with a ing and intr. Let ceo*r. As c=ca+cb, ceor. 


If o,,°°*,0, are ideals with o, strongly separated from each of 
0s, then is strongly separated from os. For, let aj +b; =1, 
2,---,s, with a; in o, and bj ino; Then 

-b, = (1 — a2) (1—a,) 
and here b.--: bs €o2°++os While ceo;. A fortiori, o, is strongly separated 
from 

6. <A set of ideals o;,:--,o, will be said to be strongly separated in pairs 


if each o; is strongly separated from each o; with 741. For such ideals, by § 5, 


An Indecomposable Ideal. 


7. We use the field of rational numbers. Let A =y?—1, 3 = [A]. 
The forms of & are of the type 


(4) MA+M’A,+---+MMA, 


where A; is the j-th transform of A. 
The manifold of = consists of 1 and —1. We proceed to show that 3 
has no representation either as a product or as an intersection of two ideals 
whose manifolds consist respectively of 1 and —1. 
Let 3, and &. be ideals with the respective manifolds 1 and —1. Then 
=, contains a form B which does not vanish when each y; is replaced by —1 
and &, contains a form @ which does not vanish when each y; is replaced by 1. 
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If & is sufficiently large, the k-th transform C; of C will contain no y; which 
appears in B; we choose such a k. Then BC;, fails to vanish when the y; in B 
are replaced by —1 and those in Cy; by 1. Now if & were the product or the 
intersection of S, and &., & would contain BC, and BC, would be of type (4). 
Then BC; would have to vanish when the y; are replaced in any manner, 
independently of one another, by 1 or —1. This proves the non-existence, 
for 3, of the described representations. 

The system of equations } = 0, considered as an algebraic system in an 
infinite set of unknowns y;, has, in addition to the solutions yj =1 and 
yj =—1, j=0,1,---, an abundance of other solutions, which cannot be 
used for the construction of analytic solutions of the system & of difference 
polynomials. One may, on this basis, properly blame the algebraic system for 
the indecomposability of 3. The thought now arrives that an ideal A will be 
decomposable if A contains some power of every form in {A}. The restriction 
suggested here is actually too strong. It suffices that A contain, fer every form 
G in {A}, some power of some transform of G. Speaking in the ignorance 
which exists with respect to the analytic solutions of non-linear difference 
equations, one might make the vague statement that it is permissible for the 
algebraic system A = 0 to have some, but not too many, solutions which do 


not yield analytic solutions of A. 


Complete Ideals. 

8. An ideal o in @ will be called complete if, for every element a in {c}, 
there exist a positive integer p and a non-negative integer j, each depending 
on a, such that a; being the j-th transform of a, aj? eo. 

We proceed to prove that, for an ideal o to be complete, it is necessary 
and sufficient that the presence in o of a product of transforms of an element 
should imply the presence in o of a power of a transform of the element. 

The necessity is obvious. For the sufficiency, we consider an ideal o which 
satisfies the condition, and use the sets on of $2. If aeo,, some product of 
transforms of @ is in oa, so that some power of some transform of a@ is in o. 
Let aeo,. There is a product b of transforms of a such that 


(5) b=hu+---+kv 
with u,---,v ino, For every j and p 


11 Henceforth, in this paper, transforms will be indicated by subscripts. 
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By the case of n = 1, bj? eo if j and p are both large. Then some power of 
some transform of a is ino. Continuing, we find « to be complete. 
The intersection of any finite set of complete ideals is complete. 


9. An ideal o will be called mixed if the presence in o of ab implies 
the presence in o of ab,. A mixed ideal satisfies the italicized condition of § 8. 
Hence, every mixed ideal is complete. 

The intersection of any set, finite or infinite, of mixed ideals, is mixed. 


10. We prove the theorem: 


TuHeEorEM IJ. Let o be a complete ideal. Suppose that {o} is the inter- 
section of s perfect ideals 7,,- + -+,7: which are strongly separated in pairs. 
Then o has one and only one representation as the intersection of s complete 
ideals with {oi} =7i, In this representation, 
° are strongly separated in pairs. 

By § 6, the intersection of the o; equals their product. 

11. We let += {oc} and consider first the case of s = 2. 

Two elements will be said to be congruent, modulo an ideal, if their 
difference is in the ideal. 

Let e + f =1 with e in tr, and f in 7. Then efez so that some (e;f;)? 
is ino. We write 
(6) es +fj=1 
and raise both sides of (6) to the (2p—1)-th power. In the first member 
of the resulting equation, let ¢ represent the sum of those terms in which the 
exponent of e; is not less than p and let d represent the sum of the remaining 


terms. Then ce7r,, der, and 


(7) c+d=1 
while 
(8) cd =0, [o].” 


By (8), eo so that (cd,)(¢cidz) Thus cd,er. Similarly c,d er. 
Thus, for some j and p, 

(9) c? ;d;,, = 0, [o] ; c?;,,d?; =0, [oc]. 

Putting d= 1—c in (8), we find that c= c?,[o]. This implies that 
c= c1,[o] for every g. Similar congruences hold for d and for the transforms 
ofcandd. Thus, by (9), 

(10) = =0, [o]. 


12 Of course, [c] 


= 


J. F. BRITT. 


We now write a=cj,b=—d;. Then aer, ber, and 


(11) a+b=1, 
while 
(12) ab = ab, =a,b =0, [ce]. 


12. We show now that 


(13) a, —a=b,—b=0, [c]. 
By (12) 
(14) a(b —b,) =0, [ce]. 


In (14), we put a—=1—b and b—b,—a,—a. We find that 


(15) (1— b) (a, —a) =0, [o]. 
By (12), 
(16) a, —a=0, [oc]. 


Similarly, 6, —b=0, [o]. 


13. We now let o, =[o,a] and o.—[o,b]. We wish to prove that 
{o,} —7,. It is only necessary to show that 7, is contained in {o,}. Let q 
be any element in 7;. Then bg is in +, hence in {o,}. Now bg = g —ag. 
Because ag is in oi, g is in {o,}. Similarly {o.} = 7. 

By (11), o; and o2 are strongly separated. 


14, Let us prove that o, is complete. If g is in 7;, gb is in +r. Hence 
some gj;?bj? is ino. As bj =1—day, and is in gj? is in o;. In the 


same way, o2 is complete. 


15. We prove now that 


(17) Oo. 

It is only necessary to show that the first member of (17) contains the second. 
Let geo,%o2. Because 

(18) 

with e ino. By (16), g has an expression 

(19) g=h-+ka 

with h ino. ‘Then bgeo. Similarly ag eo, so that g, which equals (a + b)g, 

is in o. 


16. We treat now the question of uniqueness. Let o be the intersection 


of complete ideals o’; and o’, with r; = {0';}, t2 = {o'2}. Let geo’;. Some bj? 
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is in Thus 0j?g is in and hence in o;. As 6; =~ 1—a,; and qa; is in 
g is in o,. Again, let heo,. Then for every j, hb; =h(1—ay;) is in o and 
hence in o’;. It follows that h(1— aj‘) is in o’, for every j and qg. For j and 
q large, aj%¢o’;. Thus heo’;. We have proved that o, and o’; are identical ; 
so also are o2 and 


This settles the case of s = 2. 


17. We treat the case of a general s. Let the theorem be true for 


sSr—1, where r>2. For s—r, if we let €=7.%- --%7,, we have 


IA 


By § 5,7, and are strongly separated. Hence with 
and ¢ complete and with 7; = {01}, = {¢}. From the case of r—1, we 


know that £=o.%- --%o, with each o; complete and 7; = {oi}. Then 
o=0,%02°° Because o, and are strongly separated and is con- 
tained in each o; with 1 > 1, o; is strongly separated from each of o2,° ++, or. 
In other words, o;,02,° * -,or are strongly separated in pairs. 

For the uniqueness, let o have a second representation = 0’, %- “o'r. 
By Theorem I, {o’.%---*0’,} =é€ where é is as above. This, by the case of 
s = 2, means that o’; =o, and that o’.%- --%o’,, which is complete, equals ¢. 
By the case of s =r—1, 1=2,° 


18. Let us prove that if «, in Theorem II, is mixed, each o; is mixed. 
We consider o,. Let gheo,. We use ¢ as in $17. Let a+b—1 where 
aeo,, bef. Then ghbeo, so that ghibeo. As b =1—aand gh, is in 
gh, is in oy. Similarly, o2,- - -,os are mixed. 

Again if o is reflexive, every o; is reflexive. Let some transform c; of an 
element c be in o;. Then, for b as above, cjb; eo so that cb is in o and thus 


in o;. As.above, ceo,. Similarly, o2,- are reflexive. 


19. We note that the elements of each o; in Theorem II are of the type 
e+ fa. with e in o and a“) a fixed element of 7;. 


Construction of Complete Ideals. 


20. Given a set o of elements of ®, there are complete ideals, for 
instance {o}, which contain «. There is not always a smallest complete ideal 
containing o; that is, the intersection of all complete ideals containing o may 
not be complete. 

For an example, we consider forms in y over the field of rational numbers. 


690 J. F. RITT. 
Let > = [yy]. Then %, whose forms are linear in transforms of yy,, is not 
complete.** For any positive integer p, let 
= Ly’, yy]. 

>» is complete because {Xp} = {y} and every linear combination of transforms 

of y’ is in &». The intersection of all 3), p=1,2,---, is [yy:]. To see this, 
let, for some p, A be a form in 3p which is not in [yy,]. Then A=B+C 
with B in [yy,] and C in [y*]. Transferring terms of C to B if necessary, o 
we shall suppose that no term in C is in [yy,]. Let q be an integer which cl 
exceeds the total degree of C in the yj. If A were in 3g, C would be in 3. i 
Writing C as a sum of homogeneous polynomials H;, we would find each H; . 
to be in Sq and therefore in [yy,]. This is impossible. There is thus no ; 
smallest complete ideal containing [ yy, ]. 
Given any set o of elements of #, the intersection of all mixed ideals ? 
which contain o is a mixed ideal. Z 
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DECOMPOSITIONS OF COMPACT METRIC SPACES.* 


By R. L. WILper. 


Let M be a compact metric space and y a local topological property. 
If y fails to hold at re M, we call x a y-singular point of M. Let T denote a 
class of compact metric spaces having the property that if MeT and if S§ is 
the set of y-singular points of M, then S is either vacuous or contains non- 
degenerate connected sets; we then call y expansive relative to YT. 

We commence with a property y that is expansive relative to a class T 
of compact metric spaces, and define [4, p. 224] for each MeT a new dis- 
tance function by means of which there is generated a continuous single-valued 
mapping of M into a new space M’: For z,yeM, consider all possible 
“-e-chains” t) (=72), (=y), where S =set of 
y-singular points of MW) and p(ti, tis.) Ce (OSt1Sn—1). The greatest 
lower bound of numbers e for which such e-chains exist we call the new dis- 
tance p'(r,y). Since p’(z,y) Sp(z,y), the mapping so generated is con- 
tinuous. If we call each point of 4 —S a y-prime part and each component 
of § a y-prime part, then the counter images of the points of M’ are the 
y-prime parts of , and we therefore call M’ the space of y-prime parts of M. 


THEOREM 1. If the local topological property y is expansive relative to 
a class T of compact metric spaces and M eT, then the space M’ of y-prime 
parts of M, if itself an element of T, has no y-singular points.’ 


Proof. Suppose the set S’ of y-singular points of M’ is non-vacuous. 
Since M’ eT and y is expansive relative to T, S’ contains a non-degenerate 
component (’.2 The counter-image C of C’ is a subcontinuum of M [4, p. 224]. 
But we may show that C CS, where S is the set of y-singular points of M 
and that the image of C in M’ is therefore a single point. For suppose there 
exists re C, Then there exists an « such that S- S(z,¢«) =0. By defi- 
nition of M’, S(a,e«) and S’(z,¢) are topologically equivalent and conse- 
quently S(2’,«) contains no y-singular points, contradicting the fact that 


* Received June 2, 1941; Presented to the American Mathematical Society Sep- 
tember 7, 1937. 
14 special case of this theorem, namely where I is the set of all compact metric 


spaces, has been stated by Whyburn [9]. 


* The argument given by Whyburn for his theorem [9] may be used from here on. 
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Applications of Theorem 1, for the case where I is the class of all compact 
metric continua, have already been pointed out by Whyburn [9]. In particular 
(for the present paper the most interesting case), if y is the property of local 
connectedness (0 —J/c) and T is the class of all compact metric continua, then 
we get the well-known theorem of Moore [7] that the space of “ prime parts” 
of a compact metric continuum is peanian. An elementary application to the 
class T' of all compact metric spaces is that where y denotes 0-dimensionality; 
we then get the theorem of Alexandroff [1] that the set of components of a 
compact metric space is itself a 0-dimensional space (whose metric may be 
taken as the distance between components). 

Now it is to be noted that the 0 —Ic property is not expansive relative 
to the class of all compact metric spaces (as for instance in the space con- 
stituted by the cartesian points (0,0) and (1/n,0), n —1, 2, 3,---), but is 
expansive relative to the class of compact metric spaces whose 0-dimensional 
Betti numbers are finite. By making the convention that an Ic" is simply a 
compact metric space with no local connectedness* properties assigned, the 
fact just stated comes under the case n = —1 of the following theorem: 


THEOREM 2. The property Wns of being (n+1)—le is expansive 
relative to the class C,"*! of compact metric spaces that are Ic" and have finite 
(n + 1)-dimensional Betti numbers. Moreover, every Wnss-singular point of 
an element M of C,"*! ts in a non-degenerate component of the closure of the 
set of Wnii-singular points of M. 


Proof. Let MeC,"** and pe M bea point at which is not (n+ 1)—le. 
By Lemma 3”* of [11], there exists an e >0 such that if U, V, W are 
neighborhoods of p satisfying the relations S(p,«) DU DV, VD W, then 
on the boundary of V there are infinitely many (n +4- 1)-cycles independent 
relative to homologies in U — W. 

Denoting by S the set of Yn,:-singular points of M, suppose that the 
component of S determined by p is p itself. Then there exists a neighborhood 
V of p such that S(p,«) © V and the boundary, F’, of V contains no points 
of S. We may cover F by a finite set of spherical neighborhoods whose closures 
lie in S(p,e) and do not meet S. Denote the open set formed by the sum of 
these neighborhoods by O, and let VU =V+0,W=V—VTV- O. It follows 
that F' is a compact subset of the locally compact metric space U — W, the 


* Throughout we use Vietoris cycles usually with a finite or compact topological 
coefficient group, and local connectedness in the homology sense: For example, a com- 
pact set is locally n-connected (n—JIc) if given e>0 there exists 5>0 such that a cycle 
of diameter <6 bounds on a set of diameter <e. A set is called len if it is i— le for 
i=0,1,---,n. These properties are localized in the usual way. 
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latter being lc"**. Consequently by Theorem 2 of [11], at most a finite number 
of (n-+1)-cycles of F are independent relative to homologies in U — W. 
But this contradicts the definition of e above. 

With the same meanings for C,,"** and wn,, as in Theorem 2, we may 
state the following theorem: 


THEOREM 3. Jf MeC,"* and the Wns-prime parts of M are simply 
i-connected* for 1=1,2,---+,n, then the space M’ of Wnis-prime parts of M 
1s Ic"*?, 


Proof. By Theorem 2, Wn. is expansive relative to the class C,"**. We 
shall show that M’ « C,,"*1 whereupon the conclusion of the theorem follows 
from Theorems 1 and 2. 

That p"**(M’) is finite follows from a theorem of Vietoris [8] and the 
fact that p"**(M) is finite. 

Consider p’« M’ and e > 0. Denote the mapping of M into M’ defined 
in the introduction by f. Then U —f"[S(p’,e)] is an open subset of M 
containing the set P=f'(p’). Let 0<8<e and V=f[S(p’,8)]; 
obviously V is a closed subset of M such that U ~ V. 

In the mapping S(p’,8) =f(V), the counter images of points are all 
simply i-connected for i= 0,1,---,n. Consequently [8] to every cycle y* 
of S(p’,8) there corresponds in S(p’,8) a cycle z such that y'~z* in 
§(p’,8) and z‘ is the mapping of a cycle T‘ of V. By Theorem 2 of [11] 
only a finite number of the i-cycles of V are independent relative to homo- 
logies in U, and as the mapping f extends to f(U) = S(p,e), it follows that 
only a finite number of cycles of S(p’, 8) are independent relative to homologies 
in S(p’,«) and hence M’ is i— lc at p’. 

Corottary. Jf M ts a Peano space such that p'(M) 1s finite, then the 
space of y,-prime parts of M is an Ic’. 

Now as a decomposition theorem, Theorem 3 is of somewhat restricted 
application when n is large. Suppose, however, we consider the property Yo" 
of being Ic", so that a Y"-singular point is one at which the space fails to be 
i—lIc for some 1 = n. 

THEOREM 4. The property Yo" of being Ic™ (On) is expansive rela- 
tive to the class C_." of compact metric spaces that have finite t-dimenstonal 


Betti numbers for alli Sn. 


Proof. Let MeC_.". If M fails to be 0 —Ic, then the closure of the 


*i.e., all the i-cycles of each prime part bound thereon. 
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set of points at which WV is not 0 —/c contains a continuum; and a fortiori 
the set of y"-singular points contains a continuum. In general, if M fails to 
have property yo", let & be the largest integer for which M is Ic*. Since 
k <n, p**(M) is finite by hypothesis and Me C,**. Hence by Theorem 2 
the closure of the set of points at which M is not (k + 1) —lIc contains a 
continuum and a fortiori the same is true of the set of ¥o"-singular points of M, 


~ 


THEOREM 5. Jf Me C_." and the yo"-prime parts of M are simply 
i-connected for 1=1,2,---,n—1, then the space M’ of yo"-prime parts of 
M ts an Ic”. 


Proof. Since by Theorem 4 yo" is expansive relative to C_.”, we need, 
in order to apply Theorem 1, only show that under the given conditions the 
space M’ « C_,". The latter follows immediately from results due to Vietoris [8]. 


CoroLuary. If Me C_,', then the space of yo'-prime parts of M is an 


We are now in a position to state our principal theorem. Hereafter if r 
and n are integers, we denote by C,” the class of all compact metric spaces M 
such that M is lc" and the Betti numbers p‘(M) are finite for r< in. 


And by we denote the property of being i—I/c fori =r, r+ 1,---,n. 


PRINCIPAL THEOREM. Let r and n be integers such that —2<r<n 
Then the property Wr" is expansive relative lo the class of spaces Cy". If 
M eC," and the Wr.."-prime parts of M are simply i-connected for 1 = 1, 2, 


n —1, then the space of Wr.."-prime parts of M is an le". 


That y,..” is expansive relative to C,”" is proved as in Theorem 4, starting 
inductively with the result of Theorem 4 (r==-—1). To prove the remainder 
of the theorem we may apply the results of Vietoris on continuous mappings 
[8] as in the proofs of Theorems 3 and 5 to show that under the given con- 
ditions the space M’ of y,,,"-prime parts of M is an element of C,"; then M’ 
is an lc” by Theorem 1. 


SomE APPLICATIONS 


In the article of Hahn [3] in which what we call the y.°-prime parts were 
introduced, it was shown that a continuum which is irreducible between two 
of its points and has more than one prime part is a simple arc of prime parts. 
In R. L. Moore’s investigations, besides the results already cited above, some 
attention was paid to the case where the space of prime parts is a simple closed 
curve [6]. It would be of interest to determine under what conditions the 
y,"-prime parts of a space constitute spaces falling among the more well-known 
topological configurations of higher dimensions. The following two theorems 


illustrate this type of application. 
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THEOREM 6. Let M bea space such that p*(M) > 0 irreducibly, p'(M) 
as finite, and the y'-prime parts of M are simply 1-connected and none forms 
a 1-barrier” Then the space of yo'-prime parts of M is either a closed 


2-dimensional manifold or a point. 


Proof. I have shown in unpublished work that if a compact metric space 
M’ has p*(M’) > 0 irreducibly, and is both semi-1-connected [11, p. 549] and 
locally 1-avoidable [12], then M’ is a closed 2-dimensional manifold. We shall 
show that the space M’ of y.'-prime parts of M, if not a single point, has these 
properties. 

By the Corollary of Theorem 5, M’ is an Ic’, hence a fortiori semi- 
1-connected. As the y'-prime parts are simply 1-connected by hypothesis, 
and simply 2-connected when M has more than one such prime part, it fol- 
lows [8] that p?(M’) = p?(M) > 0. Suppose M’ has a proper subset F” 
such that p*(F”) > 0. Let y* be a non-bounding cycle of F”. Then [8] there 
exists on F” a cycle 2 such that y?>~ 2? on F” and f*(z2*) is a cycle of 
F =f"(F’) (where f is the mapping of M into M’). But F is a proper sub- 
set of M and consequently f?(z?) ~9 on F. But then z?~0 on F& and 
y?>~0on F’. Thus p?(M’) > 0 irreducibly. 

Consider any p’ eM’, « > 0, and let 0<4<8< such that 1-cycles of 
F(p’,8) bound in S(p’,e). As M’ is Ic’ at most a finite number of 1-cycles 
of F(p’,8) fail to bound in S(p’,e) —S(p’,n). Let y' be such a cycle. Let 
p=f1(p’) and F =f"[F(p’,8)]. There exists on F'(p’,8) a cycle 2* such 
that ~~ on F(p’,8) and = f(z") is a cycle of F [8]. As y1~0.in 
S(p’,e). ~0 in S(p’,e), hence 0 on M [8]. As ‘p is not a barrier of 
then on some closed subset K of M—p. But then 
on f(K) C M’ — p’, implying 0 on M’— yp’. It follows that can be 
so chosen that for every y! of F(p’,8), yi} ~0 on M’—S(p’,n) and M’ is 
locally 1-avoidable. This completes the proof. 

For the case n > 2 we have the following theorem: 


THEOREM 7. Let MeC", p"(M) =1 and p"(F) =0 tf F 1s a proper 
subset of M. Then if the Wo""'-prime parts of M are simply i-connected for 
i==1,2,---,n—1, and none is an i-dimensional local barrier for 2,°*-, 
n—2 nor an (n—1)-barrier of M, then the space M’ of such prime parts 1s 


either a generalized closed n-manifold [10] or a point. 


Proof. Suppose ¥ is non-degenerate. Then the y%o"-'-prime parts of M 


‘i.e. if P is a y,'-prime part and 21 is a 1-cycle which lies on M— P and bounds 


on M, then 2! bounds on M— P. 
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are simply i-connected for all i= n and p"(M’) =1 [8]. And if F” is a 


proper subset of M’, p"(F”’) = p"[f"(F’)] =0. 

That M’ is Ic" follows from Theorem 5. Hence if p’e M’,0< n<b<e, 
the number of 1-cycles (i = n —1) of F(p’,8) independent relative to homo- 
logies in S(p’,«) —S(p’,y) is finite [11]. The local (n — 1)-avoidability of 
M’ now follows as in the proof of the local 1-avoidability in the preceding 
theorem. 

Consider the case 1 [i= n—2. For any e>0 the set VU =f"[S(p’, )] 
is an open subset of M containing P = f'(p’). By hypothesis there exists a 
6; >0 such that every i-cycle of S(P,8,) —P bounds on U —P, where 
S(P,8,) CU. By continuity considerations there exists a 8 such that 
V =f"[8(p’,8)] C S8(P, 8). Consider any cycle of F(p’,8). It is 
homologous on F(p’,8) to a cycle z‘ such that f-'(z‘) is a cycle T* of 
f?[F(7’,8)]. As T* bounds on U—P, z bounds on S(p’,e) —p’ and 
hence y* bounds on S(p’,e«) —p’. From the observation stated in the second 
sentence of the preceding paragraph it now follows that there exists an 7 
such that 0<7/ <& and every i-cycle of F(p’,&) bounds on S(j’,e) 
— 8(p’,7’). Thus M is a generalized closed n-manifold. 

These theorems have applications in turn to subsets of the euclidean 
n-sphere S". For example, a set M satisfying the hypotheses of Theorem 6 
and lying in S* will, in case its y'-singular points are not dense in it, have 
exactly two complementary domains and be their common boundary. (In this 
connection see [5,6]). 

Evidently a theorem similar to Theorem 7 is derivable from the Principal 
Theorem; i.e., Theorem 7 remains true if we replace C™} by Cy" and yo" 
by 

In closing we remark that in a paper published in 1930 [2] Aronszajn 
has determined conditions on the class of sets which serve to define a given 
local topological property y, that are sufficient to ensure that every point of 
the set S of y-singular points of a compact metric continuum should lie in a 
non-degenerate continuum of S. Since application of Theorem 1 above 
requires only the existence of at least one non-degenerate component in the 
closure of S, it might be of interest to investigate general conditions sufficient 
to ensure expansiveness as we have defined it. It is clear from the above results 
that in any such investigation the problem is one of adjusting the conditions 
on the class of sets defining the topological property to the class of spaces 
under consideration (the latter were continua in the Aronszajn investigation). 
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ON FOURIER AVERAGES.* 


By AUREL WINTNER. 


If g(t),—_ <t< oo, is integrable on every finite t¢-interval, let 
M:{g(t)} denote the average 


T 
M:{g(t)} =lim arf. g(t) dt, 
aT -T 
provided that this limit exists. 
If g(t) =0, put 


(even if Mi{g(t)} 0). 
If, for a given f(t), 0 <t< o, the average 


Mi{f(t +s)f(t)} 


exists for — oo <s< co and represents a continuous function of s, then 
f(t) is said to possess an auto-correlation. 
For a fixed real A, the A-th Fourier average of an f(t), —-w <t< 9%, 
is defined by 
c(A) = Mile f(t) }, 


provided that this average exists. 

It was proved by Karamata’ that, if M+{| f(t)|*} < oo and if 
exists for every A, then the function | c(A)|?, — 0 <A< o, is of bounded 
variation and vanishes nearly everywhere (that is, except for a set of A-values 
which is enumerable, at most). Karamata’s proof of his theorem is quite 
elaborate. The following, more general, theorem is an immediate consequence 
of an adaptation of the classical argument of Bessel: 


(I) If the upper mean square, M;{| f(t)|?}, of @ measurable function 
f(t),—_ 0 <t< o, 1s finite, then the set of those values X for which the 
Fourier average, c(A), of f(t) exists but does not vanish is enumerable, at 
most; furthermore, tf A1,°**,An ts any set of A-values for which c(Xr) exists, 
then 


* Received February 26, 1941. 
1 J. Karamata, “ Sur la moyenne arithmétique des coéfficients d’une série de Taylor,” 
Mathematica (Cluj), vol. 1 (1929), pp. 99-106. 
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(1) f(t) |?} —3 | C(Am) |? = 0. 
In fact, since 
(2) f(t) c(Am)e* |?} 
m=1 


is non-negative, it is sufficient to prove that the expression (2) is identical 
with the difference (1), if c(A) exists for -,An. Furthermore, it 
may be assumed in the proof of this identity that f(t) is real-valued. Then, 
if c(A) exists, c(—A) exists also, it being the complex conjugate of the 
number c(A). 

Since M:{| f(t)|?} << © is a constant, and since c(+ Am) exists for 
m==1,---,n, the absolute square beneath the M;{--- }-sign of (2) is the 
sum of (n+ 1)? terms, (n-++-1)?—1 of which possess averages Mr{ ---} 
(and not merely finite upper averages M;{---}); while the (n+ 1)*-th term 
has the upper average M+{| f(t) |*}. Since this implies that the expression (2) 
is identical with the difference (1), the proof of (I) is complete. 

In view of (I), it is natural to ask whether an unspecified measurable 
f(t) can or cannot possess a non-enumerable set of non-vanishing Fourier 
averages c(X), if either the assumption, {| f(t)|?} < 0, of (I) is com- 
pletely omitted or is replaced by the weaker assumption M;{| f(t)|} < ©. 
The answer is not known to be affirmative; not even under the additional 


assumption that all the Fourier averages c(A) of f(t) exist.? 

The following considerations deal with the question of existence of the 
Fourier averages. More specifically, a sufficient condition will be obtained for 
those functions f(t) for which the Fourier average c(A) exists for every A 
not contained in a A-set of measure zero. The method to be applied consists 
of an adaptation of a procedure by means of which certain considerations of 
Riemann were legalized recently.* The excluded zero set will not, in general, 
be enumerable; in, other words, c(A) will exist “ almost everywhere ” but not 
necessarily “ nearly everywhere.” This circumstance makes the result to be 
obtained irrelevant for the purposes of an harmonic analysis of ergodic 
systems.‘ In fact, for these purposes the following theorem was needed : 


(II) If f(t) possess an auto-correlation, then the Fourier average c(X) 


2 Cf. A. Wintner, “ On the distribution function of the remainder term of the prime 
number theorem,” American Journal of Mathematics, vol. 63 (1941), pp. 233-248. 

3A, Wintner, “On Riemann’s fragment concerning elliptic modular functions,” 
American Journal of Mathematics, vol. 63 (1941), pp. 628-634. 

*N. Wiener and A. Wintner, “ Harmonic analysis and ergodic theory,” American 
Journal of Mathematics, vol. 63 (1941), pp. 415-426. 


700 AUREL WINTNER. 


of f(t) exists for nearly all 4, provided that f(t) is either a bounded function 
or satisfies certain, more general, Tauberian conditions. 


It is not known whether or not the assertion of (II) remains true if no 
Tauberian proviso is made. No such proviso is made by the following lemma, 
which assumes very much less than (II) but makes a weaker statement 
than (II): 

(III) Jf f(t)|?} < 0, then c(d) exists for almost all 

(III) was proved by Karamata’ for the case of a sequence, where 
f(t) =f(n) forn<tSin+1;n=—0,+1,+2,---. He first verified the 
corresponding Abelian fact and then applied the fundamental Tauberian 
theorem of Hardy and Littlewood concerning the equivalence of the summa- 
tion processes of Abel and Cesaro in the slowly oscillating case. However, it 
turns out that bya simpler procedure much more than (III) can be established: 

(IV) Jf f(t) ts integrable on every finite t-interval, and if there exists 
a sufficiently small « > 0 satisfying 

T 1 
(3) | f(t) |?dt = O(T**) as &, 
J-T 


then the Fourier average, c(A), of f(t) exists and vanishes for almost all X. 


Notice that (1) is applicable to both (II) and (III) but not to (IV); 
in fact, the assumption of (III) is the particular case «1 of the assumption 


(3) of (IV). 
If h(t), 0OSt < o, is a function for which 


T 
q th(t)dt 


exists for every T > 0 and tends, as 7 — «, to a finite limit, then, since the 


= 


tends to the same limit, a partial integration shows that 


mean-value 


1 if 
h(t)dt->0 asToOo. 
T Jo 


Thus it is clear that, if g(t), — 0 < ¢< , isa function which is integrable 
on every finite t-interval and has the property that 


ON FOURIER AVERAGES. (01 


tends to a finite limit as T — o, then 


1 


which means that M;{g(t)} exists and vanishes. 
This Abelian remark, when applied to the function g(t) =e *f(t), 
where A is fixed, implies that (IV) is a corollary of the following theorem: 
(V) If a function <t< is integrable on every finite 
t-interval, and if there exists a sufficiently small « > 0 satisfying (3), then 


the improper integral 


1 f 1 
-1 t \ J-T -1 t 


as convergent for almost all x. 
The proof of (V) can be based on the following theorem of Plancherel : ° 
If a function /(¢) is not only of class (Z*) on the interval — ow <t< 


but satisfies the additional restriction 


( { — tog | < 0, 


then the Fourier transform 


200 
eiuth(t)dt 


is not only convergent in the mean (Z*) but exists for almost all wu. 


On placing in this theorem 
—u=d and k(t) = f(t)/t (if |¢| > 1), 


we see that, in order to prove (V), it is sufficient to show that 


by virtue of (3). In other words, it is sufficient to show that, if (3) is 
satisfied, 


id cor? 
f(t) |? = O(1) as T> 
1 


Hence, partial integration shows that it is sufficient to ascertain that 


5M. Plancherel, “Sur la convergence et la sommation par les moyennes de Cesaro 


..,” Mathematischen Annalen, vol. 76 (1915), pp. 315-326. 
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But this is obvious from 


d log?t logt 1—logt log? ¢ 
dt £2 £3 )-0(4). 


This completes the proof of (V). 
An Abelian corollary of the Abelian corollary, (IV), of (V) may be 


formulated as follows: 


(VI) If @ sequence of constants a,,a2,° - - satisfies the estimate 
(4) | dm |? = O(n?-€), n—> 0, 


for a sufficiently small « > 0, then the regular function defined by the power 
serves 


(5) F(z) an2", |z| <1, 

n=1 
behaves in the neighborhood of the circle | z| 1 in such w way that 
(6) F (ret?) =0(1—r)", r— 1, 


holds for almost all @; furthermore, this estimate remains valid for almost 
all }, 1f the radial approach is replaced by Stolz paths. 


In fact, if ¢:, c2,--* is a sequence of numbers for which the arithmetical 


mean tends to a finite limit 


aT Cn 
lim == (/, 


then, according to the Abelian theorem of Frobenius, 


co 
lim (1—r) car" = C, (r<1); 
n=1 


furthermore, 


lim (1— 2) = C, 


n=1 


if x is complex but is restricted to Stolz paths leading to r 1. Hence, on 


choosing 
C =0 and Cn = ane™, 


where ¢ is fixed, we see that, in, order to prove (VI), it is sufficient to verify 
the following statement: If a sequence a;,d2,- - - satisfies (4) for a suffi- 
ciently small « > 0, then 


702 
m=1 
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lm — = + an 


n—>00 


holds for almost all ¢. But this statement is equivalent to the one which 
results on formulating (IV) for sequences, instead of for functions. 
As an application of (VI), consider Lambert’s own series,® 
ae 
(8) L(z) = 3 —— =37(n), <i), 
1 


anh 
n=1 ~ & n= 


where r(n) denotes the number of the divisors of n. Thus 


(9) 1=r(n) ~O(1) 
and, as is well-known, 
(10) t(n) =O(n®*) for every fixed 0. 
It is also known that the circle | z | —1 is a natural boundary? of (8), and that 
the increase of (8) in the steepest direction, arg z = 0, is characterized by ® 
1 1 
L(r) ~ log r—>1—Q0O. 


However, the increase of (8) in the direction of a “general” ¢ = arg z is 


less steep; in fact one has, for almost all ¢, 


=O (; and even L(re'?) =o (; 


(if not a still better estimate). 
This is implied by (VI), since, as is seen from (10), condition (4) is 


satisfied for a, —7(n) by certain « >0 (in fact, by every positive « <1). 
On the other hand, (9) shows that Karamata’s condition, that is, the case 
e=1 of (4), is not satisfied. 

For more general Lambert series, (VI) leads to the following result: 


(VII) If a sequence of numbers ¢;,¢2,° * + satisfies 
(11 Cn = O(n'™) for a sufficiently small yn > 0, 
7 


then the boundary behavior of the regular function which ts defined for 
|z| <1 by the Lambert series 


6 J. H. Lambert, Anlage zur Architektonik ..., vol. 2 (1771), § 875. 

7 This is implied by Carlson’s theorem on power series with integral coefficients but 
can be verified directly. 

* Cf. the end of the paper by E. le Roy, “ Valeurs asymptotiques des certaines séries 
...,” Bulletin des Sciences Mathématiques, ser. 2, vol. 24 (1900), pp. 245-268. 
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gh 


12 F > 
( ) (2) n= 7 1 — 2" 
ts subject to (6) for almost all $; furthermore, the same estimate holds, for 
almost all }, even if the radial approach is replaced by Stolz paths. 


In fact, (11) insures that (12) represents a regular function for 
|z| <1. Furthermore, it is clear that the coefficients, a, of the power 


series, (5), of (12) are given by 


Ay = Ca, 
din 


where d runs through all the divisors of n; so that, from (11), 


On = 1 = = O(n?-™), 
din 


Since 7 > 0 and 8>0 can be chosen arbitrarily, it follows, by choosing 
0<$—7+8< 3b, that (4) is satisfied by some «>0. Hence, (VII) is 
implied by (VI). 

The only result known in the direction of (VII) seems to be an Abelian 
theorem of Knopp and Landau,’ which states that, if the coefficients of the 


Lambert series (12) satisfy the drastic assumption 


Cn 
(13) p> < 
n=1 
then 
00 (ret? (rei? ) n 
lim (1 — 1) lim (1—1r) a 
1 (re*? )* 1 (re°?)* 3 


along with the corresponding relation for Stolz paths, holds for every angle ¢; 
so that (6) holds for every irrational $/z. 

It is interesting that, roughly speaking, (13) is more stringent than (11) 
by the order of Vn. In particular, (13) is violated even by Lambert’s own 
series (8), where ad, 1. However, it does not appear to be known whether 
or not (6) is true for all irrational ¢/z, if (12) is given by (8). In fact, the 
problem of the “heaviest singularities,” being an arithmetical issue of “ major 
and minor arcs,” requires to-day an explicit analytical technique of the 
“nearly all,” and not the above Lebesgue approach, which leads only to the 
unspecified “almost all.” Correspondingly, the order Yn mentioned above 
can be thought of as a manifestation of the standard improvement of the 
apparent order in case of random distributions. 


THE JouHns Hopkins UNIVERSITY. 


® For a proof much simpler than that of these authors, ef. H. Spath, “ Uber Lam- 
bertsche Reihen,” Mathematische Zeitschrift. vol. 30 (1929), pp. 481-486. 


ON METHODS OF SUMMABILITY AND MASS FUNCTIONS 
DETERMINED BY HYPERGEOMETRIC COEFFICIENTS.* 


sy RALPH PALMER AGNEW. 


1. Introduction. We begin by introducing pertinent notation and results 
from a theory originated by Hurwitz and Silverman?! and by Hausdorff.2 A 
sequence-to-sequence transformation of the form 

AnkSk 

k=0 
commutes with (or is permutable with) the arithmetic mean transformation 
C, if and only if it has the form 


where Ao, Ai,° °° is a sequence of complex constants. The sum in braces is, 
in difference notation, A"”*A,. If A’ and A” are two transformations of the 
form A(A) generated respectively by two sequences A,’ and A,”, then the 
product transformation A* = A’A” is generated by the sequence An* = An/An” ; 
and the extension to products of three or more factors is obvious. If A, + 0 for 
each n, then A(A) has an inverse generated by the sequence A,’ of reciprocals of 
the elements of the sequence A,. One of the main reasons why transformations 
which commute with C, are of importance lies in the fact that two trans- 
formations which commute with C, commute with each other: this fact is 
proved and its significance is emphasized both by Hurwitz and Silverman 
and by Hausdorff. 

The transformation A(A) is conservative * if and only if a mass function 
x(t) exists such that 
(1) x(0) =0 


IA 


(2) x(¢) has bounded variation over 0S ¢ 


* Received December 30, 1940; Presented to the American Mathematical Society 
December 30, 1940. 

1W. A. Hurwitz and L. L. Silverman, *“ On the consistency and equivalence of cer- 
tain definitions of summability,” Transactions of the American Mathematical Society, 
vol. 18 (1917), pp. 1-20. 

2 F. Hausdorff, “‘ Summationsmethoden und Momentfolgen, I and II,” Mathematische 
Zeitschrift, vol. 9 (1921), pp. 74-109 and 280-299. 

® That is, such that the existence of lim s, implies the existence of limg¢,. 
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al 
(3) i™dy (t) (n = 0, 1, + =); 
0 


and is regular‘ if and only if x(t) exists such that (1), (2), and (3) hold 
and also 

(4) x(1) =1 

(5) x(#) is continuous at t —0. 


A mass function x(t) satisfying (1), (2), (4), and (5) is called a regular 
mass function; and a sequence A» for which a regular mass function exists 
such that (3) holds is called a regular moment sequence. 

The Cesaro method C,, r being a complex constant different from 


—1,—2,---, is generated by the sequence 
(6) 


The method C, is regular if and only if r—0 or 7’ = Rr> 0. Here and here- 
after z is used to denote the real part of a complex number z. The identity 
transformation is generated by the sequence A,’ —1, n ~0,1,2,---. 
The Holder methods H, are generated by the sequences (n + 1) and have 
the property H,H, = H,,;. The well-known fact that the Cesaro method C, 
and the Holder method H, are equivalent when 7” > —1 may be expressed 
by writing C, ~ H,. Since the Cesaro and Holder methods C, and Hg have 
the form A(A) and therefore commute, we can use this equivalence to obtain, 
when +s >—1, 


this implies that 
(7) +s’ >—1). 
It is a corollary of this result that C, @ Cg if —1l<a&w <p’. 
2. Hypergeometric summability. Corresponding to each set of com- 


plex constants a, 8, y for which 0, —1, —2,--- let a sequence An(@, y) 
be defined by Ao(a, B, y) and, when n — 1, 2, 3,- - 


(8) An(%, B,y) = 


These A’s are the coefficients in the familiar power series expansion 
SAn(a, B,y)z" of the hypergeometric function F(a, B8,y;z). The sequence 


‘That is, such that the existence of lim s, implies lime, = lim s,. 
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An(%, B,y) determines a transformation A(An(@,8,y7)) which defines the 
hypergeometric method of summability denoted by (H,«,B,y).° Using (8) 
and (6) we find that when a, B,y+0,—1,—2,:-- 


so that use of the theory of §1 gives the identity 
(9) (H, «, B, y) = 4-1 (a, 


The order of the factors on the right is immaterial since the factors commute. 
So much is known about the Cesaro methods C,, especially when 7” > —1, 
that formula (9) and formula (12) which we obtain below furnish an easy 
method of determining properties of (7,2, 8, y). In particular the results 
of the section on hypergeometric summability, pp. 195-201, in the paper of 
Garabedian and Wall become more comprehensible when interpreted in the 
light of (9) and (12). 
Using (9) and (7), we can show that 

(10) BY > 05 +B —2); 
in case B’,y’ > 0; > + —2; and 0 < =1 this follows from 

Y Y -1 ~ e 

C B-1 B-1 ( y-a-B+1 
and in case > 0; y + —2; and >1 it follows from 


Combining (9) and (10) we see that if 


then 
(12) (H, &, B, y) 


8. Regularity and moment sequences. If 


(13) >0; 
or if 
(14) > 0; y=2+ 68 —1, 


5H. L. Garabedian and H. S. Wall, “ Hausdorff summation and continued fractions,” 
Transactions of the American Mathematical Society, 48 (1940), pp. 185-207, p. 196. 
In this paper, the parameters a, 8, y are restricted to positive real numbers for which 
8 is an integer and y >a+f—l. 
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then Cy<-g.: is regular and (11) holds. Hence if (13) or (14) holds then 
(11, a, is regular and it follows that the generating sequence An (2, B, y) 
of hypergeometric coefficients must be a regular moment sequence. 

If 
(15) B’,y > 0; a+ —1, 
or if 
(16) yAa+f—1, 
then (11) holds and the transformations C,<-s,, and (H,2,B,y) are not 
regular and in fact not conservative. It follows that if (15) or (16) holds, 
then An(a, 8, y) is not a regular moment sequence and in fact is not a moment 
sequence. 

It can be shown easily by another method that if 


(17) a, B,y~0,—1,—2,° °°; <¢+ 
then (H, a, 8, y) is not conservative and An(a, 8, y) is not a moment sequence, 
Under the hypothesis (17), Stirling’s formula shows that 

| An (a, B, y)| = An*(1 + on) 
where A is a constant not 0, 6 = — (7’ — x — B’ —1) > 0, and 0, 30 as 
n—> oo. The sequence An(%, 8,y) is accordingly unbounded and cannot be a 
moment sequence, and hence (H, 2, 8,y) is not conservative. 


CORNELL UNIVERSITY, 
ITHaca, N, Y. 
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INFINITE GROUPS GENERATED BY EQUILONG 
TRANSFORMATIONS OF PERIOD TWO.*? 


By Epwarp Kasner and JoHn Der Cicco. 


1, Introduction. The problem of this paper is to determine the 
infinite groups generated by all equilong transformations of period two. In 
our preceding work,* we found that the set of all equilong transformations of 
period two may be classified into three distinct types: (7) equilong involu- 
tions, (7) K symmetries, and (7) D inversions. This is in contrast with 
the conformal theory where Kasner has proved that the set of all conformal 
transformations of period two consists of two distinct types: (J1) conformal 


involutions, and (¥.) conformal symmetries (Schwarzian reflections). 


We thus have five distinct types of transformations of period two: three 


equilong and two conformal. 


The conformal types and the infinite groups generated by them have 
been discussed by Kasner.* In this paper we shall consider the analogous 
situations in equilong geometry. 

Conformal transformations are correspondences between the 7? points 
of the plane which preserve or reverse the angle between the two directions of 
any two curves at®their common point of intersection. Equilong transforma- 
tions are correspondences between the «7 lines of the plane which preserve 
or reverse the distance between the two points of contact of any two curves 
along their common tangent line. Conformal transformations are defined by 
monogenic functions of the complex variable 7, + iy,, where 7? = —1 and 
(2%, y,) are the cartesian codrdinates of a point; whereas equilong transforma- 
tions are given by monogenic functions of the dual variable x + jy, where 
j?=0 and (2, y) are the hessian or equilong codrdinates of a line. 

We shall consider only those equilong transformations of the plane which 


* Received March 4, 1941. 

1 Presented to the American Mathematical Society, April, 1940. 

*Kasner and De Cicco, “ Equilong and conformal transformations of period two,” 
Proceedings of the National Academy of Sciences, vol. 26 (1940), pp. 471-476. 

’Kasner, “Infinite groups generated by conformal transformations of period two 
(Involutions and symmetries),” American Journal of Mathematics, vol. 38 (1916), 
pp. 177-184; Kasner, “ Conformal geometry,” Proceedings of the Fifth International 


Congress of Mathematicians, vol. 2 (1912), p. 81. 
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convert the positive y,-axis (with equilong codrdinates (0,0) ) into itself and 
are regular. Such regular equilong transformations are expressed by power 
series of the two forms 


(1) Z = %2 + G22? + +--+, 0:0, 
Z= O92? + a’ 328 + 0, 


where 7 = x — jy is the conjugate of the dual variable z = x + jy and the 
coefficients are arbitrary dual numbers of the form a+ jb. Separating (1) 
into real parts, we find 


X = + + +--+, = y(a; + + + ---) 
(2) + (b,x + boa? + +---),a, 0; 


The direct equilong transformations, as given by the first of equations (1) or 
by the first two of equations (2), form a continuous infinite group G. The 
reverse equilong transformations, as given by the second of equations (1) or 
by the last two of equations (2), do not form a group. However, if we add 
this set to our group G, we obtain a mixed group G@’. 

We determine all regular equilong transformations of period two. In the 
direct type Z =f(z) the functional equation is f[f(z)] ==, that is f? =1; 
and in the reverse type Z = f(z) the functional equation is f[f(z)] =z, that 
is ff= 1 where f denotes the series whose coefficients are the conjugates of 
the coefficients of the series f. Thus there are two types of functional equa- 
tions which are to be solved. The first type yields as a solution a set (7;) of 
equilongly equivalent transformations (besides the identical transformation). 
On the other hand, the second type yields as a solution two equilongly distinct 
sets (T.) and 

It can be shown that any equilong involution (7,) can be reduced 
to the simple form 7 =—z (ordinary symmetry in the positive y,-axis) ; 
that any K symmetry (7'.) may be changed to the simple form Z = Z (ordinary 


symmetry in the origin accompanied by reversal of orientation) ; and that any 


D inversion (7;) can be reduced to the simple form 7 = — 2 (ordinary sym- 
metry in the x,-axis accompanied by reversal of orientation). We observe also 
that the third type (73) is equivalent under the equilong group to Laguerre 
inversion. Of course, these simple transformations are considered to be corre- 


spondences between the lines of the plane. 
In our calculations, we find a striking resemblance between the roles played 


by (71) and (T;) while (T:) stands apart. 


er 


le 
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The equilong involutions generate an infinite group G’ invol? SHY trans- 
formation of this group can be factored into involutions in an infinitude of 
ways, of which at least one will contain four or fewer factors. The trans- 
formations which are products of an even number of equilong involutions 
of G’ 


form a continuous subgroup 

Any K symmetry may be defined as the unique reverse equilong trans- 
formation which leaves fixed the tangent lines of a given curve (.* In this 
respect, K symmetry may be considered to be the dual of conformal symmetry 
which may be defined as the unique reverse conformal transformation which 
leaves fixed the points of C. The group K’, generated by all K symmetries 
consists of K symmetries and K translations (the products of two K sym- 
metries). Any transformation of this group sends any line into one parallel 
to itself such that the distance between any two parallel lines is preserved or 
reversed. The K translations form a continuous subgroup K _ of ia 

Finally we shall discuss the group D’.. — enerated by all D inversions. 
Any transformation of this group can be factored into D inversions in an 
infinitude of ways, of which at least one will contain four or fewer factors. 
If the number of factors is even, the resulting transformations form a con- 
tinuous subgroup of 

Our fundamental result is that the group generated by all equilong 
transformations of period two is identical with the group generated by all K 
symmetries and D inversions. In particular the groups eas eis and 
Dy are all subgroups of this larger group. See the exact formulation of 


invers 
the final group in Theorem 18. 
2. The discussion of (T,) equilong involutions, (T.) K symmetries, 
and (T;) D inversions. First let us note that any direct equilong trans- 


formation may be written in either of the two forms (see *) 


Since an equilong involution is a direct equilong transformation of 


period two, we obtain from the preceding equations the following result: 


THEOREM 1. Any equilong involution is given in the implicit dual 


variable form by 


(4) + a,(Z —z)* + a (Z 


*Kasner, “ Equilong symmetry with respect to any curve,” Proceedings of the 
National Academy of Sciences, vol. 26 (1940), pp. 287-291. 


_] 
? 


EDWARD KASNER AND JOHN DECICCO. 
where the ao» are arbitrary dual numbers. In the implicit real form, any 
equilong involution is 


(5) —z)? + a,(X —2)* 
Y + y= (¥ —y) [2a.(X — + 4a,(X — x)? + 6a,(X —2z)5 
+ [b2(X —x)* + + 


Any equilong involution in the explicit dual variable form is 


where the yon are arbitrary dual numbers. Finally in the explicit real form 
any equilong involution is 


(6) — 2 + 


X =— — + + (— BAA, + 
(7) y[— + 21.7 — 4\,2° + 5(— BAA, 
+ [dox? — + dyat + (— — + ++ 


In the power series expansion for X, the odd coefficients are polynomial 
functions of the arbitrary even coefficients. In the power series expansion of 
the part in Y not containing y, the odd coefficients are polynomial functions 
of the even coefficients in this part and of the even coefficients in Y. 

Next let us note that any reverse equilong transformation may be written 
in either of the two forms 


+2) + +2)? + +2)? +: 
y1(Z—z) + + ys(Z—2z)? H1. 


Now any K symmetry may be defined as a reverse equilong transforma- 


x 


tion of period two, which may be written in the form given by the first of the 
preceding equations. Hence 

THEOREM 2. ‘Any K symmetry is given in the implicit dual variable 
form by 
(9) +2) +b.(Z +2)? +b3(Z +2)? 
where the b’s are real numbers. In the real explicit form, any K symmetry 
may be written as 


(10) X =2, =—y4+ 2(die + doa? + 
In the explicit dual variable form, any K symmetry is 
(11) Z = 2—2j(diz + doz? + 


We observe that the K symmetry (10) or (11) may be defined as the 
unique reverse equilong transformation which leaves fixed the tangent lines 


of the curve y = + + 


f 
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Finally any D inversion is a reverse equilong transformation of period 
two, which may be written in the form given by the second of equations (8). 
Therefore 


THEOREM 3. Any D inversion is given in the implicit dual variable 
form by 


(12) [a2 (Z 2)? 
+ j[b:(Z—2z) + 


z)* + ae(Z—z)* +- 


where the a’s and b’s are arbitrary real numbers. In the implicit real form, 


any D inversion. may be written as 


X + — az)? + a,(X —z)* + —z)*+---, 
(13) F¥—y=(¥ + y) [2a.(X — x) + 4a,(X + +---] 
— [b.1(X —z) + + b5(X 


In the real explicit form any D inversion is 


(14) y[—1 + — + + 5(— Brody + 
+ [dia + 


+ 5/2 (— — Aad, 2d2°d;)a* + 


In the power series expansion for X, the odd coefficients are polynomial 
functions of the arbitrary even coefficients. In the power series expansion of 
the part in Y not containing y, the even coefficients are polynomial functions 
of the odd coefficients in this part and of the even coefficients in X. 

Thus we see that in the K symmetry (10), all the d’s, both odd and even, 
are arbitrary, but in the D inversion (14) the answer depends on the even A’s 
and the odd d’s. 


3. The reduction of equilong transformations of period two to 
canonical forms. First we shall demonstrate the existence of at least one 
real function (A, 0), such that + = 0, where 
X is given by the first of equations (5). That is, we must have 


(15) + Ao(X? + + As(X* + 
+ + xt) 0,10. 


Upon making the substitution ¥ — x = ¢, we find that we can solve this and 
the first of equations (5) for 2 and X in terms of ¢ obtaining 

(16) (t+ at? +--:). 
Substituting these values of x and X into (15) and equating the coefficients 
of the different powers of ¢ to zero, we obtain the system of equations 


2 
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+ + + = 0, 
1 2n—1 
where Lon is a linear function of (A, A2,* * *,Aen-2) with coefficients which 


are polynomials in (@2,44,° - *,@2n). This system of equations shows that 
all the even coefficients As» are determined as linear functions of odd coeffi- 
cients Azn-, such that the coefficients of these linear functions are polynomials 
in the dan. Thus there exist many functions ®(x) with the desired property. 

Under the direct equilong transformation X = @(xr), Y = y@,(x), our 


equilong involution (5) becomes X Y =—y-+ 2(d.7? dyx* 
The transform of this under the direct equilong transformation X —z, 
Y = y— + - -) is the equilong involution =—vz, Y =—y 
(or Z =—2). 


Under the direct equilong transformation XY Y = y— + 
+ d,7°+---), the K symmetry (10) is reduced to the K symmetry Y =z, 


Y =—y (or Z =2). 

Under the direct equilong transformation XY = @(r), Y = y@,(2), our 
D inversion (13) becomes Y = y+ 2(dix + 4+ 
The transform of this under the direct equilong transformation X —g, 
Y=y+ dz? +d;a>+---) is the D inversion Y=y 
(or Z =—2Z). Therefore we have proved the following theorem: 


THEOREM 4, Any equilong involution can be reduced to the symmetry 
through the positive y,;-arvis: Z=—z. Any K symmetry may be changed 
to the symmetry through the origin accompanied by reversal of orventation: 
Z =. Any D inversion may be reduced to the symmetry through the x,-auis 


accompanied by reversal of orientation: Z = — zi. 


It may be shown without much difficulty that these three sets of equilong 
transformations of period two are (equilongly) mutually exclusive. 

4. The infinite group G’, , generated by equilong involutions. The 
first question which we wish to consider is that of finding the form of those 
equilong transformations which can be obtained as the product of two equilong 
involutions. Given a function f(z) we inquire whether it is possible to find 
two functions g(z) and h(z) such that, symbolically, f—=hg, g?=1, h° =1. 
If we write the transformations f, g, h in the real explicit forms 


oS 
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f:X=—f(x), Y=yfe(z) + F(z), 
(18) 9g: X=—$(x), Y=yde(x) + (2), 

h: + ¥(c), 
our problem is to find the functions ¢, y, ®, ¥, which satisfy the conditions 
for equilong involution 


(19) = 2,06) —2,¥(6) 


and which satisfy the two relations 


(20) = f(x), + ¥(¢) = F(z). 

First we shall proceed to discuss the equation ®(¢) =f(z). Since $ 
and ® are involutorial, the leading coefficient of f(a) must necessarily be 
unity. Our question is that of finding when the equation 


(21) X=—2r+ e277 + + 
can result from the elimination of the variable wu from, the two equations 


defining two arbitrary involutions. 

It is convenient to write e, = 2*e,, and to introduce t=a2—vu. Elimi- 
nating x, u, X from this and the preceding three equations, we obtain the 


following identity in the single variable ¢ 


(23) + 
+ Ao (t + + + a 


where P = ¢ + a.t? + ayt* + agt® +---. Arranging the preceding equation in 
powers of ¢ and equating coefficients, we obtain an infinite system of equations 
which must be discussed. 

Let us note that 


+ + 6 Pt + + + + 6P§ 
= eof? + (2doe. + + + + €,) 

+ + + €;)t° 
+ (Ba, + ao? + + Saves + 
Ages + + (4a, + + 10a27€; + 6a2€5 + e; |t" 
[ ( + ay? Jeg + (86 + + (12a2a, + 48 
+ (Say + 10a.*)e; + + Tazez + €s 

+ + 3a47)eg + + eg + (200204 + 49 
[ + (6a, + + + + 


9 


| 
+ 
(24) + | 
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Substituting this into (23), we see that the equations corresponding to 
and are 


This gives a necessary relation — 2e.2 0, or — = 0. 

Let us assume that e, 0 so that «+0. The equations corresponding 
to 7?" and 7¢*"*1 are of the form 
(26) don — Aon = Pon (Qe, * * * * » Aen-2), 

R€2 (don — NAon) == Ponsi(2,* * * on-23 * *,Aen-2), 

where Pon and Pons; are polynomials in the preceding a’s and A’s. Therefore, 
if n > 1, we can solve these equations for don and Asn as polynomials in the 
preceding a’s and A’s. Since for n =1, our equations (25) show only that 
As = Ao + €2 provided that e; — e.* = 0, it follows that all the coefficients doy 
and Az» of our two real involutions (22) can be determined as polynomial 


functions of Thus 
THEOREM 5. Lvery real transformation of the form 


(27) + + +--+, e;—e.? =0, 0, 


can be factored into two real involutions. This can be done in c* ways. 


Suppose now that e.=0O(e.—0). The equations (25) show that 
=0 (e,—=0) and The equations corresponding to ¢*, ¢°, 


and ?® are 


+ e5 0, Ag — + Bare; + €5 Ag, 
4 (as — Aq — + 4dveg + — 0, 
(4a6 — 6Ag — — — 

+ — 20d2A, — 15a2* 

+ (6a, — 4A, — 10as*)eg + + Caves + 


(28) 


The first, second, third, and fifth of these are equivalent to the equations 
+- == 0, = i, As = — €6; 

Upon substituting the above values of a, and A, in the fourth of equations (28), 
we obtain the necessary relation 
(30) 4e,* EG €47€7 = 0. 

Let us now assume that e,s£0 so that «,540. The equations corre- 
sponding to 7?” and #"** are of the form 
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Res (2d2n — NAon) = Ponsa (2,° * on-23 * Aen-2), 


Gon — Aon = Pon(Ge2,* * * * 
(31) 

where Pon and fon,z are polynomials in the preceding a’s and A’s. Thus for 
n > 3, we can solve for dz, and Azn as polynomials in the preceding a’s and X’s. 
But the equations (29) show that Ax = a2 is uniquely determined and that 
ds, Ue, Ag are determined as polynomials in a,. Therefore all the coefficients den 


and Az» of our two real involutions (22) can be determined as polynomial 
functions in a,. Thus 


THEOREM 6. Lvery real function of the form 


(32) X =2 + + + + es ~ 0, 
such that 
(33) — + — = 0, 


can be factored into two real involutions. This can be done in + ways. 


A new type arises when es = 0, and so on. The final result is 


THEOREM 7. All real functions which can be obtained as the product of 
two real involutions are of the form 


where k =1,2,3,- ++, and a single rational relation 
(35) ( Cok, = 0, 
holds between the coefficients. This can always be done in «+ ways. 


The form of this relation, as well as the number of coefficients involved, 
changes with the integer /, but in all cases e..-, can be expressed as a rational 
function of the previous coefficients. 

Next we proceed to discuss the equation ybo(¢) + ¥(¢) = F(x). The 
problem is to determine the bo» and dz» such that the equation 
[b.(a—w)? + + 2A.(u—X) + 4Ag(u— X) 

= [1+ 2a.(~—u) + 
[1+ 2a.(u—X) + [fia + fea? + fav? + 


where w and X are defined by the equations (21) and (22) is identically 


(36) 


satisfied. Let us write « — 2*e, and m = 2*f;, and introduce the substitution 
t=-x—u. Upon eliminating z, u, X from this and the equations (21), (22), 
and (36), we obtain the saevite equation in the single variable ¢ 


i 
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— [b.t? + ++---][1+ 2a.(¢ + + 
+ (t + + + + 4 [1 + + 4 
(37) x + + + da(t + + + + 
= [1 + 2a.t + 4a,t + ---](1— 2dr.(t + oP? + 
— 40,(¢ + eP? + +--+)? 
x [mP + + + 


where P = ¢ + ast? + att +---. On arranging the above equation in powers 
of ¢ and equating coefficients, we obtain an infinite system of equations which 
must be discussed. 


The three equations corresponding to ft, ¢?, and ¢° are 


38 
— + + ods yz + — 


These equations with (25) give as necessary relations 7, = 0 and y3 — 4esn2 = 0, 
or fi 0 and fs = 0. 
Let us assume that e240 so that e«, 40. The equations corresponding 


to #?% and ¢?"*! are of the form 


(39) — 2ebon + + don = (b2,° Don-2; *, den-2), 


where I, and Js»,, are linear polynomials in the preceding b’s and d’s. There- 
fore if n > 1, we can solve these equations for be, and ds, as linear poly- 
nomials in the preceding b’s and d’s. Since for n —1, our equations (38) 
show only that dz = bs + mp, we find that all the ben and den can be determined 
as linear polynomial functions of b.. The coefficients of these are polynomial 


functions of a. Hence 


THEOREM 8. Lvery equilong transformation in the real form 


(40) Y =y(1 + + +--+) + (few? + +---), 
fs — 2eof2 == (), 


or in the dual variable form 


(41) tyne te 
can be factored into two equilong involutions. This can be done in ©? ways. 


Assume next = 0. Then —0 and f; Thus we have the con- 
ditions = 13 = 0. The equations corresponding to 2°, ¢*, ¢°, and ¢* are 


GROUPS GENERATED BY EQUILONG TRANSFORMATIONS. 


— bo + 2, —bs + dy = 4 — 30272, 
2d2(— bg + ds) — + (404 + 25) de 
= 95 + 4dons + (2a, — — 4€5) 72, 
— be + dg — + (12a2€, + 2e5) de 
(42) =o + + 227m + (— 140204 — 18a2€, — 402*) 
22 (— be + de) — + 4( a4 + €4) ds 


By means of equations (29), the preceding equations are equivalent to 


dz=be+ yo, dy + 94 30270, 
— 2e,b. = 5 + — (2a, + 2a.2 + Ges) 
(43) dg = De + ye + + — (20a2a, + 10 aot + 
4egbs — nz — — 12 + (— 16a.° + 
+ (4ag + 32a27a, + 24a.° + + 


By equations (29) and (43), we obtain the necessary relation 


44 + — — 16€4°ns — 


+ 5es°”n2 — = 0. 


Let us now assume e,5<0 so that e, 40. The equations corresponding 
to 12", #2"*?, and are of the form 


— Don don = lon (D2,° +, Don-43 * dene), 
— Dense + dense = de,* *,den-2), 
2a2(— Donse + dense) — 4Asbon + (404 + 2nex) don 


where lon, lonso, and lon,, are linear polynomials in the preceding 6’s and d’s. 
These equations show that if'n > 2, the bon and ds, can be determined as 
linear polynomials in the preceding 6’s and d’s. This fact combined with 
equations (43), show that all the be» and dz, can be determined as linear 


polynomials in b, and polynomials in a,. Thus 
THEOREM 9. Lvery equilong transformation of the form 


X + + + + 040, 
(46) Y=y(1 + + + + +s) 
+ (fou? + frat + 


where the two rational relations 
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8e,* — e5° + — = 0, 

Rea fz — 4es*esfe — 4es*eofs + + eseofs — 
— 464°f, — + — — 6eses* 
+ + 5e5°f. — 34e,%¢5"f. = 0, 


hold between the coefficients, can be factored into two equilong involutions, 
This can be done in oc? ways. 


(47) 


A new type arises when e, 0, and so on. The final result is 


THEOREM 10. All equilong transformations which can be obtained as 
the products of two equilong involutions are of the form 


(48) Y=y(1+ + (2k + 1) 
(2k 2) + (fox? + + + ‘), 


where the k rational relations 


( Cox: Cak-1) = 0, 
Sx (Cox, Cok+1y° ° 5 C4k-15 fax-1) 
+ ti: ( C2k+19° ° fe, fa fox-2) = 0, 
fs =f (Cox, Coks1, Coks2, Coxss3 fo, fa), 
= fz (eax, * Cokss 5 fe, fas fe), 
fom-1 = fox-r(Cox,* * Cax-sy fo, fox-2)s 


(49) 


hold between the coefficients. This can always be done in «7 ways. 


The forms of these & relations depend on the integer k, but in all 
cases €4;-, can be expressed as a rational function of the preceding e’s, the 
fs, fr° * *5f2x-1 can be obtained as rational functions of the preceding e’s 
and even f’s, and f4_, can be determined as a rational function of the pre- 
ceding e’s and f’s. 

By Theorem 10, we find that for each value of the integer i, there is a 
single set of equilong transformations which can be factored into two equilong 
involutions. No one of these sets has the group property. The same is true 
of the totality of all the sets. 

All transformations in question are, however, included in the larger class 


(50) yet? + ys +: — y2 = 0, 


which does constitute a group, as may be verified immediately. 

A simple example of a transformation of the group (50), which is not 
of the form specified in Theorems 8, 9, or 10, and hence cannot be factored 
into two equilong involutions is Z—z-+- z*. We shall now show, however, 


‘ 

( 
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that every transformation of the group (50) can be factored into four or 
fewer equilong involutions. 

If, in (50), y2 does not have its real part zero, we already know by 
Theorem 8 that two equilong involutions are sufficient. Consider, therefore, 
any transformation 7 of our group (50) for which the real part of y2 does 
y2? =0. We 
can factor T into two transformations T’ and 7” both of the form (50) and 
such that the real parts of the coefficients y2’ and y2” do not vanish. This is 
seen from the fact that the product 7’T”’ is of the form z+ (y2’ + y2”)2?+::-, 
and hence the real part of the coefficient of z* can be made to vanish without 


vanish. Of course, then y; 0 on account of the relation ys; 


taking either of the real parts of y.’ and y2” equal to zero. We already know 
that T’ and 7” can be factored into two equilong involutions. Hence 7 can 
be factored into four equilong involutions. 

If we multiply (40) or (41) by a general equilong involution, we obtain 
a type which can be factored into three equilong involutions. We thus have 


completely proved the following proposition. 


THEOREM 11. The only transformations which can be obtained as 
products of equilong involutions are of the two types 


(51) °°, Ys — = 0, 


These form a mixed group which we denote by G’,,.,. This ts the group 
generated by all equilong involutions. 


5. The infinite group K’. generated by all K symmetries. Any K 


symmetry may be defined as the unique reverse equilong transformation which 


sym 


leaves fixed the tangent lines of a given curve C. When this curve C is given, 
the direct construction of our K symmetry is easy. Let / be any oriented line 
in the plane. Construct the tangent line ¢ of the curve C which is parallel to l. 
The correspondent LZ of J under our K symmetry is the line parallel to both 


1 and ¢ such that ¢ is midway between J] and LZ. Thus 


THEOREM 12. The construction of K symmetry with respect to the curve 
C is accomplished by means of ordinary symmetry in the parallel tangent lines 
of C. 

The product of two K symmetries is not a K symmetry. We shall call 
any such correspondence an equilong translation. Concerning the group gen- 
erated by all K symmetries, we discover the following proposition. 


THEOREM 13. The group generated by all K symmetries 1s a certain 


mized group K’_ consisting of K symmetries and K translations, expressed 


sym 


as follows 


| 
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52) XxX Y=ayt+ (d,a + + +: 
In the dual variable form, this may be written as 
(53) Z or Z=2 j(dyz + doz? + daz? 


In either form, the first correspondence represents a K symmetry and the 
second a K translation. 


Any transformaton of this group may be defined as a correspondence 
which carries any line into one parallel to itself and which preserves or reverses 
the distance between any two parallel lines. Of course, this group K’. 
contains the continuous group of equilong translations as a subgroup. 


6. The infinite group D’. 


savers Qenerated by all D inversions. The 
next question that we wish to consider is that of finding the form of those 
equilong transformations which can be obtained as the product of two D 


inversions. Given a function 7 = f(z), we inquire whether it is possible to 


find two functions Z = g(z) and Z =h(z) such that symbolically, f = hg, 


99 =1, hh =1. If we write the transformations f, g, h in the real explicit 


forms 
Y—yfe(z) + F(z), 
(54) g:X=¢(r), Y + 
h: X=@(r), Y=—y®,(r) + V(z), 


our problem is to find the functions ¢, y, ®, ¥, which satisfy the conditions 
for D inversion 


v 


and which are such that the two relations 
(56) O(¢) =f (xr), + = F(x), 
are identically satisfied. 

We have already discussed the equation @(¢) = f(z) in connection with 
the corresponding problem about equilong involutions. Therefore it only 
remains to discuss the equation —yo(¢) + ¥(¢) = F(x). The problem 
is to determine the Don-, and dz»_, such that the equation 


— [b,(7— u) + b3(a— u)* 
+ [d,(u—Z) + 
(57) [1 + 2a2(a@— wu) + 4ay(a@— 
= [1+ + +- 
[1 + 2ae(u— X) + 
x [fw + + 
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where w and X are defined by the equations (21) and (22), is identically 
satisfied. Let us write and = 2*f;, and introduce the substitution 
=x—u. Upon eliminating z, u, X from this and the equations (21), 
(22) and (57), we obtain the following equation in the single variable ¢ 


— [b,¢ + 
K [14+ + 44(¢+ ef? +6P? 


[1 + 20,t + 40,f* --] 


= [1+ 2at + 4a,f*+---] 
x + 2rA.(¢ +:---) + 4A, (t+ 6P?+ 


where P = + + + Arranging the above equation in 
powers of ¢ and equating coefficients, we obtain an infinite system of equations 
which must be discussed. 
The two equations corresponding to ¢ and ¢? are 
(59) —b,+d,=—=—n, — 2A.b, + d,(2a2 +> €o) = — no + m(— a2 
The equations corresponding to ¢?""? and ¢*" are of the form 
— + dens = lens (01,° Don-s3 don-s), 
(60) — + [2a2 + (2n —1)ee|don-s 
The above two sets of equations demonstrate immediately that, if «0, all 
the Do», and do»-, are uniquely determined as polynomials in 4a». 
THEOREM 14. Every equilong transformation in, the real form 
T+ ¢ + + e,2* Q, — 0, 


(61) 


y(1 + + + + + (f,2 + + 


or in the dual variable form 


(62) (1+ ye +: 


Yn = Gn C2 
can be factored into two D inversions. This can always be done in «1 ways. 
Assume next e2 = 0 so that e; = 0, or =e; = 0. The equations corre- 
sponding to ?¢, ¢*, and are 
—b+da,= b,+d; = — ns + 
(63) 2a.(— b + —4A4b, + (e, + 4a,)d, 
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The equations corresponding to ¢?"-1, ¢2"*1, and ¢2*? are of the form 


— Don-1 + don-1 = lon-1 (b,, Don-s; d,, don-s), 
— Dons + dons: = longs (bi, don-3), 


(64) 2s dans: ) 


where lon_1, lons1, lon: are linear polynomials in the preceding b’s and d’s, 
These two sets of equations show that all the ben, and don, are uniquely 
determined. 


THEOREM 15. Every equilong transformation of the form 


(65) Y= y(1 4¢,2° 4. -) 
+ (fia + + faa? +--+), 


where the two rational relations 
(66) 8e,* —— Cs = 0, esfi == 0, 


hold between the coefficients, can be factored into two D inversions. This 
can always be done in «+ ways. 


New types arise when es = 0, and soon. The final result is: 


THEOREM 16. Ail equilong transformations which can be obtained as 
the products of two D inversions are of the form 


(67). Y= y(1+ + (2k + +--+) 
+ (fia + for? + fra? -), 


where the k rational relations 


( Cox, 5 Caker) =Y, fe <= fo(e€ox, C2k+1 5 fi ) 
(68) fs fs (ox, Cok+15 Cok+25 Cok+3 5 hi; fs 


for-2 fox-2 (Cok; 5 Cak-3 5 fi, fs; fex-s ) 
hold between the coefficients. This can always be done in ~* ways. 


By the preceding theorem there is, for each value of the integer k, a set 
of equilong transformations which can be factored into two D inversions. 
No one of these sets has the group property. The same is true of the totality 
of all these sets. By an argument similar to the one used in proving Theorem 


11, we obtain the following result: 


THEOREM 17. The only transformations which can be obtained as 
products of D inversions are of the two types 
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(1+ €3 — = 0; 
where yn = €n + jfn and = + jf'n. These form a mized group D’ 
This is the group generated by all D inversions. 


(69) 


If the group K’_ , given by equations (52), is multiplied by the group 


sition is obtained: 


given by the preceding equations, the following fundamental propo- 


THEOREM 18. The group generated by all equilong transformations of 
period two is 


Z or Z= (14 jfi)z + yor? + = 0, 
Zor Z=—(—1+4+ ester? =0, 


3 


where yn = én + jfn and y'n = + Any transformation of this ‘group 
may be factored into K symmetries and D inversions in an infinitude of ways, 
of which at least one will contain either one K symmetry and four or fewer D 


inversions, or four or fewer D inversions. 


It is observed that the equilong theory as developed in this paper differs 
very essentially from the corresponding conformal theory. Thus in the equi- 
long theory both the K symmetries and PD inversions are needed to obtain the 
entire group generated by all equilong transformations of period two, whereas 
in the conformal theory the total group generated by conformal transforma- 
tions of period two can be factored into conformal symmetries only. 
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UNIVERSAL FUNCTIONS OF EXTENDED POLYGONAL 
NUMBERS.* 


By L. W. GrirritHs. 


1. Introduction. Certain universal functions of extended polygonal 
numbers were obtained in an earlier paper.’ In the notations of that paper m 
is a fixed but arbitrary integer = 3. The extended polygonal numbers are 
the integers 
(1) e(xz) =— 2+ m(2°+ 2)/2, (cz =—1,0,1,2,- °°). 


All universal functions were determined if m = 3, 4,5, with the sum of 
the coefficients in a function = m+ 1. The reason for this condition on the 
sum of the coefficients was stated in that paper. All functions which might 
be universal were determined if m = 6, with the sum of the coefficients in a 
function =m. Certain of these functions were proved universal. In this 
paper the remaining functions are proved universal. Thus all universal func- 
tions of extended polygonal numbers, with the sum of the coefficients S m, 


have been determined. 


2. The universal functions if m=6. The following notations are 
restated from the earlier paper on extended polygonal numbers. The integer 
m is fixed, whereas n, @,,2,° °°, are positive integers to be determined, 
For k =1,---,n the e, are extended polygonal numbers e(a;,). The function 
f=apit---+anpn is said to represent the positive integer A when 
Pn can be chosen so that a,p, +--+ =A; and f is said to be 
universal when f represents every positive integer. Also 


It was proved that no function represents the integers 0,1, 2,- - -,3m—4 
if wn < m. It was also proved that if w, =m then f represents the integers 


0,1,2,---,6m—8 if and only if f is one of the following functions: 


(2) 


* Received April 9, 1941. 
1L. W. Griffiths, “ Representation by extended polygonal numbers and by generalized 
polygonal numbers,” American Journal of Mathematics, vol. 55 (1933), pp. 102-110. 
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UNIVERSAL FUNCTIONS OF EXTENDED POLYGONAL NUMBERS. (2% 


IIA 


THEOREM. [f f satisfies (2) or (3) or (4) then f is unwersal. 


The fact that f is universal if f satisfies (2) was proved in the earlier 
paper on extended polygonal numbers. In Theorem 2 of that paper it was 
also proved that if f satisfies (3)' then f represents all positive integers A 
except perhaps those such that 105m — 14 < A < 296m — 80. In that same 
theorem it was further proved that if f satisfies (4) then f represents all posi- 
tive integers A except perhaps those such that 105m —14< A < 513m + 210. 
It has been proved now that if f satisfies (3) then f represents each integer A 
such that 105m —14 < A < 296m — 80, and that if f satisfies (4) then f 
represents each integer A such that 105m —14< A < 513m + 210. 

The general methods of proof will be illustrated for integers A such that 


151m — 22 < A < 153m — 23. Define f. = (a3,---,@n), and write bj = di42 
(i=1,---,n—2) andb,+---+); —w’;. Now represents the integers 
0,1,- +, w’n-2, because, as noted at the bottom, of page 104 of the earlier 


paper on extended polygonal numbers, the lemmas 1, 2 of another paper? are 
valid for extended polygonal numbers. Hence f. represents the integers 
0,1,---,m—2. But obviously f = a,e, + + f. and a, =a,—1. Hence 
f represents the integers e, + @., e: If 
is taken to be the value e(16) — 136m —16 of (1), and if e, is taken to be 
the value e(5) =15m—5, then it is seen that f represents the integers 


151m — 21,- - -, 152m — 23. 


Again, if f, is defined to be (as,---,a@n) and if the notations cj = Qis3 
(i=—1,---,n—3) and w/’—c,+--+-+c are used, then it is seen by 
Lemma 1 of the paper just mentioned that f,; represents the integers 
0,1,- °°, except perhaps Since +1, 
C=, and w”i.;—Wiss, it is true that f,; represents the integers 
0,1,: - -,m—3 except perhaps a;,;—1-+ Ai+s. Hence f represents the 
integers + + es, @: + eotes+m—d except 
perhaps e, + + + diss —1 + A;.,. If e; and are taken as before, 
and if e. is taken to be the value e(1) = m —1, then it is seen that f repre- 
sents the integers 152m — 22,- - -,153m— 25 except perhaps 152m — 22 
+ —1+ Now e(4) =10m—4 and e(3) =6m—=3. There- 


fore e(16) + e(4) + e(3) = 152m —23. Hence this possible exception 
2. W. Griffiths, “ A generalization of the Fermat theorem on polygonal numbers,” 
Annals of Mathematics, ser. 2, vol. 31 (1930), pp. 1-12. 
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152m — 23 + diss + Aiss is actually represented by f with e, = e(16), 
= = e(3), Cg = e(0) =0, = e(—1) —1, and 
the remaining e; either 0 or 1 according as a; does not or does appear in Aj,,. 

Finally, that f represents 153m — 24 is proved as follows. Note that 
e(15) = 120m — 15, e(6) = 21m — 6, e(14) = 105m — 14, e(8) = 36m — 8, 
e(?) =28m—%. Then if f satisfies (4) with a;—4, the equation 
153m — 24 = 105m — 14 + 36m — 8 + 2(6m —3) + 4 indicates that the 
choice e; = e(14), ¢2 e(8), es =0, eg —e(3), es =1, = en = 0 
gives a representation of 153m — 24. The equation 153m — 24 = 120m — 15 
+ 21m — 6 + 2(6m—3) +3 is used if f satisfies (4) with a; = 2 or 3. 
The equation 153m — 24 = 120m — 15 + 28m—7+ 5(m—1) +3 is used 
if f satisfies (3) with a, = 5. But if f satisfies (3) with a, = 2, 3, or 4, then 
the first equation is again used. It remains to represent 153m — 24 by the 
function (1,1,1,1,2) of (3). Here m=6, and therefore 153m — 24 
= 154m — 30 = 91m — 13 + 45m — 9 + 15m —5 + 3(m—1) indicates the 


representation. 
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RELATIVISTIC EQUATIONS OF MOTION IN ELECTRO. 
MAGNETIC THEORY.* 


By P. R. 


Introduction. It has been found that under certain simplifying assump- 
tions it is possible, on the basis of Maxwell’s equations and the relativistic 
field equations, to obtain a closed form of the equations of motion of charged 
particles represented as singularities of the field. The significant assumption 
made in order to obtain this result was that gravitational effects (i. e., those 
arising from non-linear terms involving the masses of the particles) were 
negligible in comparison with “ purely electromagnetic ” effects. In the present 
paper this assumption is dispensed with and a general formulation of the 
equations of motion is obtained. It is found in this case that it is no longer 
possible to express these equations in a closed form, and that recourse must be 
had to the “new approximation method ” which has been used previously to 
derive the gravitational equations of motion from the relativistic field equa- 
tions.” * This makes it possible to obtain a series of successive approximations 
to the equations of motion based essentially on the smallness of the velocities 
of the particles in comparison with the velocity of light. 

We shall attempt here only the problem of two bodies under mutual 
interaction. Further difficulties arising from the extension to n bodies are 
formal only, and not fundamental. The equations of motion will be solved 
up to a specified approximation for two oppositely charged particles, and the 
results compared with those obtained by others. 

The general outline of the method is based on E. I. H. and E, I., and the 
calculation of the equations of motion follows the lines indicated in these 
papers except for the parts concerning radiation, which derive from the paper 


* Received March 27, 1941. 

1Infeld and Wallace, Phys. Rev., vol. 57 (1940), p. 797. 

2 Rinstein, Infeld and Hoffmann, Ann. Math., vol. 39 (1938), p. 65. 

* Einstein and Infeld, Ann. Math., vol. 41 (1940), p. 455. The first of these papers 
will be referred to hereafter as E. I. H., the second as E. I. 

‘This problem has been worked out by the author in a thesis “On the Relativistic 
Equations of Motion in Electromagnetic Theory,” University of Toronto, 1940. This 
thesis, in which the details of the calculations involved in the present paper may be 
found, will be referred to as T. 
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on gravitational radiation by Infeld.® For the sake of brevity and convenience, 
specific references will be made to all these papers in the exposition to follow. 
The calculations involved in the solution of the equations of the two-body 


problem are very similar to those of Robertson ® in the gravitational problem, 


1. Derivation of the equations of motion. We shall start from field 
equations which differ from those of E. I. H. by the inclusion of the electro- 
magnetic energy-momentum tensor, i. e., 


(1.1) Ruv — dgpvR + = 0, (p, y == (0), 1, 2,3), 
in the usual notation, x being the gravitational constant. T'yv is defined by: 


The velocity of light will be taken as unity throughout. The electromagnetic 


field tensor Fyp may be expressed in terms of the four-potential yv: 


(1. 3) = yv;p — Yp:v = — 


where the notation “;p ” indicates covariant differentiation with respect to 2, 
the codrdinates of space-time, and the stroke indicates ordinary differentiation 


with respect to a codrdinate. 
To the equations (1.1) are added the Maxwell equations 


(1. 4) Fee ., == 0. 
These may be written also in terms of the four-potential : 
(1.5) — + Rupy’ = 0. 


As is customary in solving these equations, we shall assume y’;p = 0, so that 
the second term of (1.5) vanishes. 

(1.1) and (1.5) constitute a set of ten independent equations in fourteen 
unknowns. As in E. I., the addition of four non-tensorial “ codrdinate con- 
ditions ” gives the problem the correct degree of determinacy, and gives rise 
to equations of motion of singularities of the field (i. e., of guy and yv), which 
occur as conditions for the consistency of the set of equations consisting of 
field equations and codrdinate conditions. 

The existence of equations of motion follows as in E. I. If we define 


(1. 6) Spv = Tv — 


where muy is the metric of empty space-time, the equations (1. 4), (1.5) and 
(1.6) of the paper E. I. will be altered by the addition to their left-hand sides 


5 Infeld, Phys. Rev., vol. 53 (1938), p. 836. To be referred to as Be 
® Robertson, Ann. Math., vol. 39 (1938), p. 104. 


tic 


ion 


hat 


ne 


RELATIVISTIC EQUATIONS OF MOTION. 731 


of 1627«(Smn, Soo, Som) respectively, (m,n —=1,2,3). Therefore the equations 
of motion will become 


k 
(1.7) (2Amn 162xSmn) cos N)dS = 0, 


k 
(1. 8) f (2Aon + cos (z",N)dS =0, 


where the “i” above the integral sign indicates that the surface encloses the 
k-th singularity. Because the divergence of the integrand vanishes in each 
case, the integrals (1.7) and (1.8) do not depend on the shape of the surface 
of integration. They depend only on the codrdinates of the singularities and 
their time derivatives. 

The approximation method used is that of E. I. ($4). The expansion of 
the field quantities in terms of the parameter A makes it possible to solve the 
equations (1.1) and (1.5) along with the codrdinate conditions by the method 
of successive approximations. As in E. I. also, the components of the metric 
tensor are replaced as unknowns by the quantities yyy defined there, and the 


coordinate conditions chosen are 
(1. 9) Yon|n — Yoo|o = 90, 
(1. 10) Ymn|n = 0. 


In the first approximation these are invariant under a Galileian transformation. 
From the solutions of these equations we may calculate successive approxi- 
mations to the equations of motion. These equations may be written 


a0 k 
(1.11) m = 0, 
l=lo l 
ao k 
(1. 12) 0; 
l 
where 
k 


k 
== (2Amn 162KS mn ) cos (2", N)dS 

and 


k 
k & 
dn (C, — 10.0) (2Aon + cos (a", N) dS. 


k . 
The é* are the codrdinates of the k-th singularity, and dots represent derivatives 
with respect to the “ auxiliary ” time r= Az°. The numbers written beneath 
the various quantities indicate the power of A with which each is associated in 


the power series expansion in which it occurs. 
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The calculation of the gravitational potentials ypv is similar to that of 
K. I. H., these potentials being modified by the addition of terms arising from 
the inclusion of the quantities Suv in the field equations. The electromagnetic 
potentials yy are calculated as in I., being modified only by the addition of 
“interaction ” terms (i.e., terms involving products of mass and charge) 
arising from the entry of the gravitational potentials into the equations (1.5). 

As in Infeld and Wallace *, the possibility arises of choosing one of two 
possible forms of solution for the potentials. One proceeds in alternate powers 
of A only, and may be identified as the “standing wave” solution. This 
solution does not take account of the effect of radiation. The other, which 
proceeds in consecutive powers, is the solution generally known as the 
“retarded potential,” and does take account of radiation. From the formal 
point of view they are equally valid, though physical considerations give 
preference to the latter. 

The details of the calculations of the equations of motion may be found 
in T.; we shall be content here merely to state the results. It may be noted 
that the fourth equation of motion (1.8) imposes no restriction on the motion 
of the singularities in addition to those imposed by (1.7). 


k k 
The first non-zero terms in the expansions of Cm are Cm, (k = 1,2), 
4 
and these have the values 
(1.13) Cm == 4x | — on™ (2) + 


Cm 4x Mol — KM,Me arm (2) (2) 


where r = [ (* — *) (y* — £*) ]*; 7°, €* are the space codrdinates of the two 
particles, m,,m. are their masses and ¢, é. are their charges, respectively. 


k 
On equating the expressions for Cm to zero, we obtain a first approximation 
4 


to the equations of motion. Thus the motion in the lowest approximation is 
governed by an inverse square law, which is a combination of Newton’s law 
of gravitation and Coulomb’s electrostatic law for the interaction of charges. 

In calculating the subsequent approximations, these equations of motion 
may be used to simplify the integrals obtained. 


k 
The terms C’, are identically zero, so that the next approximation is that 
5 


k 
which involves the terms Cm. These may be split into three parts: 
6 


k k k k 
14) — Cn’ ot Cu” Cn’, 
5 6 6 6 


6 


fm 
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each of which bears to the one preceding a ratio of the order of magnitude of 
xm?/e*.. We may call the three terms the electromagnetic, interaction and 
gravitational terms respectively. The latter are given explicitly in E.I.H., 
equation (16.1); in the units of this paper the expression given there would 
be multiplied by x*. The electromagnetic and interaction parts are, for the 
first particle: 


(1. 15) n= 4x€1€2 [sz on” (2 ) On? 2) ) ane 
0 1 1 sem 
+ (4) On ("0 On” On’ On? 1) ive 


Ms acm 
and 


1 
(1. 16) Cm” — 4x*e,e2(4m, + 4mz) : 


respectively. (These are calculated in T.) 


Up to this approximation the results obtained are the same whether the 
standing wave or retarded potential solutions are used. It is when we come 
to calculate C, that the difference in the solutions makes itself shown. For if 


the standing wave solutions are used, Cm—0, but the retarded potential 


solutions yield 
1 


(1.17) Cm = — (8/3) Ke, + eot™), 


This term may be called the “ radiation term,” since it arises only when we 


take account of radiation in the calculation of the potentials. 


2. Solution of the equations of motion. The equations of motion of 


the first particle may be written 
MC m + APC = 0 
4 6 7 


to the approximation considered (see (1.11) ). If now we alter the units by 
going back to the original time coérdinate 2° instead of + and change the 
units of mass and charge from m and e to M and EF, where M = ’m and 
=e, we may write the equations in a form not involving A explicitly. 
It will not cause confusion if we keep the notation for time, mass and charge 
used throughout, and understand the equations to be written in terms of the 
new units. 
With this in mind, we may write the equations of motion 
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1 
myn” +- 6102 ()- = + ) 


1 
€ )+3 — + cm), 


M, 


and a similar equation in which the masses, charges and codrdinates of the 


particles are ow which may be designated as (2.1) (b). In these 
1 
equations, Cm’ and are neglected in comparison with Cin’. 


6 
We shall solve first the equations of motion without radiation, that is to 
say, the equations in which the terms of (2.1) (a) and (b) containing third 
derivatives are omitted. The effect of radiation will be studied in a separate 
section. 
Let us introduce the variables 


(2.2) (a) = + mee") + Me), 

% are then the codrdinates of the classical “center of mass” of the particles, 
and Bm their relative codrdinates. In terms of these variables the equations 


of motion may be written? 


é/1 
(a) Mim = 46102 — (2) + — 
1 #@ 
(1 1 


0 


0 m 


r 


or 
— te, C5 + m. ) On” On? 


where M =m, + mz. 


0 vn 
Since r= (Bs8s)” it follows that (3) . Therefore, as a 


first approximation to these equations we may write 


7 Cf. Darwin, Phil. Mag., vol. 39 (1920), p. 53 
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(b) MM, T° 


In the case where the charges e, and e. have opposite signs, these equations 
have the well-known solutions representing the elliptical orbit, 


Sm = = 0; = =F cos 6; = 2=fsin 6; 
Bs =0; = (1/p)[1 + cos (@—«)], 


where the bars indicate that the values given refer to the first approximation 


to the motion. The “angular momentum ” and “ energy ” integrals are 


(2. 6) = —=—— 
2M F 2a 


These equations enable us to obtain the relative velocity 7 and its radial 
component 7 as functions of u. 
The equations (2.2) may be solved for 7” and ¢” in terms of %m and Bm. 


In the first approximation, since &m = 0, 


(2.7) (m2/M)Bm, (m,/M) Bm, 


and these values may be used on the right-hand side of (2.3) (a) and (b). 
From (2.5) and (2.6) we deduce that 


(2. 8) 7? ——— (2au— 1—apu’). 
am,Me 


Furthermore 8.8, = — u/u® =7/u. From the foregoing facts and the equa- 


€ 


tions (2.3) (a) and (b) we deduce the equation 


(2. 9) Am — ABm + BBm; 
where 
. 
A=— —— (3apu?—4au+1); B=———— uw, and = m, — m.. 


M 


2aMm,m-, 


It will be noted that in the case of the circular orbit 1/u—p—a, u—0, 
so that %»,—0 and the center of gravity experiences no acceleration what- 
soever. This is also true in the case where the masses are equal, since then 
dm = 0. 

Let us investigate the motion of the center of mass. The secular per- 


turbation of the velocity @m is given by (1/T) { &%mdt, where 7 is the period 
0 


— 
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in the elliptical orbit. If this is written as an integral with respect to 6 and 
if we substitute from (2.9), it may be expressed in the form 


For each value of the index m it is easily shown that the sum of these integrals 
is zero. 

It may therefore be concluded that there is no secular perturbation of the 
center of mass. 

We turn now to a discussion of the relative motion. Making use of the 
equations (2.6) and (2.7) we obtain for Bm the equation 


Bm 
T° 


(2. 10) | = OBn + DBm, 
where 
= u® 3japu* + [1, — 1]2au— [1,1]) 
and 
Me,e. 
M,M> 


the square bracket being defined by 
M?[k, 1] = km,? —Ilm,mz2 + km.?. 


On the right-hand side Bm has been replaced by its first approximation Bm, the 
error introduced in this way being of the eighth order and therefore negligible. 
It is obvious that in this approximation, as in the first, 8, may be taken 
equal to zero, and the motion remains plane. 
We may introduce the quantities 


(2.11) A = (8:82 — 


The former is the classical angular momentum and the latter the classical 
energy of the system relative to the mass center. To the required order of 


approximation we may write 


da 

DL 
and 

dE 


+ Dv), 


‘at —M 
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where LZ is the angular momentum in the elliptical orbit. The first of these 
equations may be integrated to give 


(2.13) A=L(1+ [1,0]w). 


Also, substituting in the expression for d€/d¢ it is found that 


d€ 
3 + [3, —1]2au — [3,1])u, 


or, integrating, 
(2.14) #H{1——— u([0, 1]apu? + [3, —1]au— [3,1])}. 


Making use of the fact that 


M 2 | du 


where uw’ = 1/r in the perturbed orbit, we may deduce an equation in wu’: 


Me,e. 


2am,m> 


([1, 1]2aw + [1,1] — [0, 3]apu?)}. 


Since now wu is known in terms of @ the equation can be integrated to this 
order of approximation. The latter bracketed terms on the right give rise 
to the perturbation of the elliptical orbit, and are responsible for an advance 
of perihelion. 

Let us investigate the secular change in the longitude of perihelion. The 
right-hand side of (2.15) may be written as a sum of constants and multiples 
of cos (@—w) and cos 2(6—w). The first type of term involves only a con- 
stant change in p, and the last a periodic perturbation. Omitting these terms. 
we may solve and obtain 


2M, M2p 
The apsides are found by solving the equation 


dw’ Me, 
— =— — sin w) — 
p 2M,Mop" 


[sin (9@—w) (@—wo) cos | = 0.- 


One apse is at 0=w; the next is at 0—=w + 2x7 + 8, where do is given by 
the equation 


dw 


If this is applied to the problem of the hydrogenic atom, e; =—e, 
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= Ze, ~ 1/1850. Changing to natural units by replacing m by m/c? 


and e by e/c*, we obtain 
e* 


mpc? 


(2. 17) 


This is precisely the result obtained from other considerations by Sommerfeld.® 

The above solution applies to the case in which gravitational effects are 
negligible in comparison with electromagnetic ones. The converse limiting 
case is the one treated by Einstein, Infeld and Hoffmann.’ If the problem is 
attempted in its most general form, proceeding from the complete equations 
of motion involving electromagnetic, interaction and gravitational terms, no 
new fundamental difficulties are encountered. The results obtained are a com- 
bination of those derived in the two limiting problems. It is found once again 
that there is no secular perturbation of the mass center. The formula above 
for the advance of perihelion is modified by the addition of the term 
(6xxM )/pc?, which is the value obtained by Robertson ® in the gravitational 


problem. 


3. Effect of radiation on the orbit. The effect of using the retarded 
potential solutions for the gravitational and electromagnetic potentials is to 
give rise to the term 2¢,(e:y" + e.¢")/3 on the right-hand side of (2.1). 
This leads to an alteration on the right-hand side of (2.3) (a) of amount 
2(e€,; + es) (e:1n™ + e.f”) /3 and a corresponding change in (2.3) (b) of amount 
2(m2.e; — + Therefore the equation (2.9) for is 
replaced by 


(3.1) tim = (A + A’) Bm + (B+ B’) Bm, 


where A’ = 3gu*u and B’ = gu*, g being defined by 


A and B are as in (2.9). 
It is easily verified that 


~(A’Bm + B’Bm)Fd0 = 0, 


so that, taking account of radiation to this order, there is still no secular 


perturbation of the velocity of the center of mass. 


® Sommerfeld, Atomic Structure and Spectral Lines, Methuen (1934), pp. 253-4. 
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The revised equation for Bm is 


Me 1 C5 Bim 


T° 


(3.2 = (C+0’)Bn + (D+ D’)Bm; 


where C and D are defined as in (2.10), C’ = 3hu*u and D’ = hu’, h being 


9 2 
a contraction for the factor <:) €,é2. Calculating the time deriva- 
3 \m, Me 


tives of A and € and using (3.2) to simplify the result, 


dA 
— = DL, + hv L, 
(3. 3) 7 DL, + hw 
(3. 4) (CFF +- Dv?), — u® (3apu? — 4au + 2), 
( i 


where the part of an expression following the comma arises from the radiation 
terms. The perturbation of A and € arising from the terms preceding the 
commas are periodic and not secular. Hence, stable elliptical orbits are 
possible if radiation terms are omitted. 

Let us consider then the effects of radiation. Since / is negative when 
the charges have opposite signs, it follows that A decreases monotonically. 
Furthermore, the radiation term in (3.4) is responsible for a secular de- 
crease in €. The amount of this decrease is found, on calculation, to be 


) 
- a per revolution. It must be concluded that when radiation 
a 


lerms are taken into account, stable elliptical orbits are no longer possible. 
It can be shown that the effect of the last term in (3.3) is to cause a 
spiralling in of the orbit. For, omitting terms which merely introduce a periodic 
perturbation in A, we have 
dX 


and since u = (1/p)|1 + «cos (@—w)] we deduce that 


mM 
(3. 5) me Ny [0 — + sin 


0 Mp 
A, being the value of A given by (2.13). But A is proportional to the rate 
at which area is swept out by the line joining the particles. All the terms of 
m,Moh 
Mp 


decreases steadily with time. This indicates a secular decrease of area at a 


the right-hand side of (3.5) are periodic except (@—w), which 


correspondingly increasing rate. In units referred to which the velocity of 
light is c, this rate is |h(@—w)/2pce*| (the factor 1/c? may be checked by 


dimensional considerations). 
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In the case of circular orbits, the equations (3.3) and (3.4) become 


dA h 
6 
(3. 6) dt bid 
d€ he,es 
(3. ) “ade = = XxX: 


where o and x are positive constants. Hence both A and € decrease uniformly 


with time unless radiation terms are omitted, in which case they are constants 
of the motion. 

The accelerations of the two particles (electron and proton) are 
fr = €1€2/m,a* and f. = respectively in the circular orbits. There- 


fore, again using units in which the velocity of light is c, we may write 


(3. 8) — wt * 
dt 3 3 


The first two terms are in accord with the familiar expression for the radia- 
tion of accelerating electrons. The last term may be compared with the result 
obtained by Synge® for the rate of decrease of energy of a system consisting 
of several charges. He does not obtain terms corresponding to the first two of 
(3.8) because of the difference in his choice of energy-momentum tensor, 
which is defined in such a way as to involve only interactions between the 
various particles. 

We wish to acknowledge the encouragement and help of Professor Infeld, 


under whose direction the work appearing in this paper was instituted. 


UNIVERSITY OF CINCINNATI. 


® Synge, Trans. Roy. Soc. Oan., Series III, vol. 34 (1940). 
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POSTULATES FOR THE THEORY OF PROBABILITY.* * 


By ARTHUR H. COPELAND. 


In order to obtain a clear understanding of how a mathematical theory 
is to be applied, it is advisable that there be mathematical counterparts of the 
important activities in which one must engage in applying the theory. In the 
application of the theory of probability there are two important types of 
activity: the observation of successes and failures and the assignment of 
numbers (i. e., probabilities) which are appropriate to such observations. The 
system of postulates which is presented in this paper differs from other systems 
for the theory of probability in that it contains a mathematical counterpart 
of the observation. Without this our ideas concerning the relation between 
probabilities and observations are likely to remain hazy with regard to such 
questions as the following. What probabilities are appropriate to given obser- 
vations? To what extent are these probabilities arbitrary and to what extent 
are they determined? In what sense can a probability be verified or fail to be 
verified by observations ? ° 

The postulate system of this paper is based on Boolean algebra. The 
existence of a mathematical counterpart of the observation is made possible by 
the following devices. First we introduce a subset of the Boolean elements. 
The elements of this subset are called atomic since they admit of no decom- 
position within the system. Second, in addition to the operators -, v, ~ (and, 
or, not) of classical Boolean algebra we shall introduce a new operator C (if). 

The analogies between the operators +, X, — of ordinary algebra and 
the operators v,-,-—~ (or, and, and not) of Boolean algebra have often been 
noted. The operator — has no analogue in the classical Boolean algebra. 
This analogue is now furnished by the operator C. Thus the operation 
z-+y is defined if and only if the denominator y is not equal to zero. Simi- 
larly «Cy is defined if and only if y is not equal to zero. Corresponding to 
the relation 


+ = 9) +4 


* Received February 17, 1941. 

1 Presented to the Society April, 1937. 

?For a more complete treatment of the philosophical aspects of the theory of 
probability the reader is referred to Copeland I. References to the literature are given 
at the end of this article. For other postulational bases for the theory of probability 
see Koopman, I, II, III, and Evans and Kleene, I. 
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we have 
2) v (yCz) = Cz. 


The analogy to the relation 
(cy) X (2+u) = Xz) + (y Xu) 


is not so complete. In Boolean algebra two fractions are multiplied in this 
simple manner only when they have a common denominator. Thus 


(eC z)-(yCz) = (r-y) C = (2-y) Cz. 
Corresponding to the relation 


= + (y X2), 
we have 
tCy=(x-y) C = (x-y) Cy. 


Unfortunately the analogy fails at the most important point. The operator C 
does not furnish us with an inverse to multiplication. Thus y: (2 C y) is not 


in general equal to z. However we do have the equation 
P(y) X Cy) = p(z) 


whenever x-y =z. In this equation p(y), p(a Cy), and p(x) are respec- 
tively the probabilities of y,  C y, and a. 

The operation 2 C y is similar to the operation y 2 (y implies 2) 
defined in Principia Mathematica.* However the definition of implication 
given in Princija is not sufficiently general to enable one to handle conditional 
probabilities or inverse probabilities. The operation «Cy defined by our 
system is adequate for such probabilities. In addition it is equivalent to yz 
in all cases where implication is required in symbolic logic. 

It is significant that although Boolean algebra is an important contribu- 
tion to mathematics, it has been applied only to a very limited extent in that 
field for which it was intended—namely, the theory of probability. Perhaps 
the reason for this neglect is: that the symbolism is inadequate to handle 
conditional and inverse probabilities. Now that this deficiency is remedied we 
obtain an extremely simple and convenient symbolism for the treatment of all 
probability problems. 

Before presenting the postulates I shall make a few remarks concerning 
notation. In order to avoid a multiplicity of parentheses, brackets, and braces 
I shall follow the conventions laid down in Principia as to the use of dots for 


bracketing. When a single dot appears in a formula, it is understood that 


® Whitehead and Russell I. 


‘ 
I 
‘ 
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everything to the right of it is to be treated as though it were enclosed in one 
bracket and everything to the left of it is to be treated as though it were 
enclosed in another bracket. Thus zvy-C z means (xv y)(C z) or simply 
(cvy) Cz When further bracketing is required, two dots are used to in- 
dicate a stronger bracketing than one, etc. Thus rvy-Cz:Cw means 
[(cxvy) Cz] Cu. When a dot is used to mean “ and,” it also has a bracket- 
ing force. Hence (x-y) Cy is written r-y: Cy. The double dot is also 
used to mean “and” when a stronger bracketing force is needed. Thus 
a*~ (yvz) is written r:~-yvz. We shall depart from the convention of 
Principia to the extent that the equality sign will always bracket. 


1. The postulates. We shall be concerned with a set B (Boolean ele- 
ments), a sub-set A (atoms), and a set N (real numbers). The elements of B 
can be combined by means of the operators -, v, ~, C (and, or, not, if) to 
form new elements of B. An additional operator p (probability of) trarisforms 
an element of B into an element of N. The expression p(x) means the 
probability of 2. We shall now present the postulates. Each one is followed 


by an interpretation. 
PostuLaTte 1. The set B constitutes a Boolean algebra. 


We shall let UV denote the unit element and 0 the zero element of the 
Boolean system. This first postulate is really a number of postulates. See 


Huntington I and Bernstein I. 


PostuLATE 2. If x is an element of B such that x-y==2 or 0 for every 


element y of B, then x is an element of A. 


PostuLate 3. If x is any element of A and y is any element of B, then 
0. 

The atoms are those elements which admit of no decomposition within 
the system. Hence an atom lies either entirely within or entirely outside of 
any other element of the system. 

PostuLaTE 4. Jf x and y are two elements of B such that r-z=y-z 
for every element z of A, then x y. 

Two elements x and y are equal if they are composed of the same atoms. 

Postutate 5. If x2 and y are elements of B and y 0, then xC y is an 

element of B. 


If z and y are events, then “z if y” is an event. 


744. ARTHUR H. COPELAND. 


PosTULATE 6. if y0. 
The event “a if y” has the same meaning as the event “z and y if y.” 
POSTULATE 7%. p(x) is an element of N. 


The probability of an event is a number. However the expression p(z) 
may be given an alternative interpretation. In fact it is proved (see Theorem 
3) that if z is an atom, then p(yC~) is either 1 or 0. The expression 
p(y Cx) is then interpreted as an observation. The value 1 designates a 
success ; the value 0, a failure. It is further proved (see Theorem 7) that if 
x is a disjunction of n atoms then p(y is the success 
ratio corresponding to the observations p(y C21), p(y p(y Can). 
All of the above interpretations require that when the operator p is applied to 
a Boolean element, it produces a number. It will be observed that this postu- 
late also consists of a number of postulates since we are demanding that the 
set of elements p(x) possess the properties of the numerical continuum. See 
Huntington II. 


PosTuLaTE 8. If z 1s an element of B and x and y are elements of A 
such that z=20 and y-z=y 0, then p(x Cz) = p(y Cz). 


Roughly this postulate states that all atoms are on a par with respect to 
probabilities. That is, if x and y are two non-zero atoms each of which is 
included within an element z, then the probability of 2 on the assumption 
that z has occurred is the same as the probability of y on the assumption that 
z has occurred. However the important application of this postulate is in the 
development of Theorem 7 where z consists of a finite disjunction of atoms 
and hence p(z C z) and p(y C z) are interpreted as success ratios instead of 
probabilities. Therefore it would be preferable to say that atoms are on a par 


with respect to success ratios. 
PosTuLaTE 9. p(U Cz) —1 if 
PostuLaTE 10. p(0Czv) =0 if 


These postulates state that the probability of a certainty on the as- 
sumption that an event z has occurred is 1 and that the probability of an 
impossibility on the assumption that an event x has occurred is 0. 


PosTULATE 11. 
+ p(yCz) — p(x: y: Cz) if 


This is the additive law of probability. It can be put into more familiar 
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form by specializing. First it follows from Postulate 10 that if the events x 
and y are mutually exclusive (i.e., x: y= 0), we obtain the equation 


=p(aCz) + p(yC2z). 


Next it can be proved (see Theorem 10) that if we then substitute U for z, 
we obtain p(xvy) = p(x) + p(y). This is the conventional form of the 


additive law. 


PosTULATE 12. The set A 1s an w-series and if 0 1s a member of A, then 
0 is the first element of the series. 


DEFINITION. An element x of B 1s called a fundamental segment with 
respect to an element n of A provided 

(a) m-x=—m for every element m of A which either precedes or ts 
equal to n. 


(b) m-:-x2=0 for every element m of A which follows n. The funda- 
mental segment x of the element n is denoted by (n). 


PosTULATE 13. Given a positive number « there exists an element N of 
A such that 
| ple (n)] —p(2)| <« 


for every element n of A which follows N. 


Without some such demands as those contained in Postulates 12 and 13 
we should be unable to obtain any relation between observations and proba- 
bilities. In fact it would in general be true that we could obtain for a given 
event any one of the success ratios 0,1/n,2/n,---+,1 by a proper choice of n 
observations. 

Postulate 12 demands that the set of atoms shall have the order type of 
the non-negative integers. Hence it is convenient to use the integers them- 
selves to denote these elements. It is proved (see Theorem 5) that the element 
0 actually is atomic and hence must be the first element. Thus the atoms are 
denoted as follows: °°. 

A fundamental segment (n) can be interpreted as the disjunction of all 
atoms up to and including n. See Theorem 8. The expression p[z C (n) ] 
is therefore interpreted as the success ratio for the first n observations of the 
event 2. Postulate 13 demands that the limit of this success ratio shall be 
equal to the probability of the event z. Thus Postulates 12 and 13 give us a 
relationship between observations and probabilities. They demand that this 
relationship shall exist only for a specified manner of choosing the observa- 
tions. This requirement does not prevent us from forming success ratios 


5 
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corresponding to sub-sequences of these observations. In fact the reader is 
referred to Definition 18 for the consideration of such sub-sequences. The 
extent to which Postulate 12 restricts the Boolean system will be discussed 
after Definition 12. 

Postulate 12 together with some of the preceding postulates demands that 
the elements possess the following structure. If x is any element of B, then 
p(xC 1), p(x C2),- - is a 1,0-sequence and represents the sequence of 
observations of z. This sequence is called the spectrum of z. A spectrum 
uniquely characterizes an element. That is, two Boolean elements having the 


same spectrum are equal. See Theorem 9. 


PostuLaTE 14. Jf x and y are elements of B, m and n are non-zero ele- 
ments of A, p[(m) C (n)] = ply C (n)], and m does not follow n, then 
tCry-(n) =2Cy-C (m). 

PosTuLaTE 15. If 2, y, z are elements of B, m and dX are elements of A, 
y =z, (m)-y=y, p[(A) (m)] = p[yC (m)] 
and X does not follow m, then zc n=2zC-n-+A for every element n of A 


with the exception of 0. 


These postulates prescribe the following relation between the spectra: 


of the elements z, y, and za C y. See Theorem 19. From the spectrum of 
y let us select in order those terms which are equal to 1 and let us denote 
them by y(™), y("),- Then where k = 2,° 
It will be observed that the spectrum of «Cy is obtained by selecting a 
sub-sequence of the terms of the spectrum of x. The selection operator is y. 
A term of the spectrum of z is selected if and only if the corresponding term 
of the spectrum of y is a 1, i.e., a success. Hence xC y appropriately 
represents the event “z if y.” 

We shall now consider more specifically what is demanded by Postulate 
14. If p[(m) C (n)] = p[yC (n)] and if m does not follow n, then m is 
the number of 1’s in the first n terms of the spectrum of y. The spectra of y 
and y- (n) agree in the first n terms and hence the spectra of xC y and 
xz C@:y- (mn) agree in the first m terms. Moreover the first m terms of the 
spectrum of «Cy are the same as the first m terms of the spectrum of 
z@Cy-C (m). Therefore we have the equality of the first m terms of the 
spectra of C:y:(n) and«Cy-C (m). We shall see that it is a consequence 
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of Postulate 15 that for each of these sequences the remainder of the terms is 
a periodic repetition of the first m terms. Thus xC:y:(n) =xCy-C (m). 

Postulate 15 is concerned with an element y whose spectrum contains 
only a finite number of successes. Such an element is characterized by the 
fact that there exists an m such that (m)-y=—y. The conditions that A does 
not follow m and that p[(A) C (m)] = p[y C (m)] then tell us that A is 
the total number of 1’s in the spectrum of y. The equation z Cn=zC-n-+A 
(where z=2Cy) states that the spectrum of z is periodic and has the 


period A. 


2. The development of the theory of probability. In this section we 
shall develop a sufficient number of theorems to assure the reader that the 
theory of probability can be derived from these postulates. Each theorem will 
be followed by an indication in parenthesis of the postulates and theorems 
upon which it is based. , 


We observe that p(rxC = zx) =p(U Cz) =1. Hence 
THEOREM 1. p(xC x) =1 if x is an element of B and r0. 
(P:1, 5,6, 7, 9) 
If = 0, then 
= Cy) + p(tz2 Cy) —p(9Cy) 
p(t Cy) + Cy). 


Hence we obtain inductively 


= C y) + y) -+ p(anC y) 
if * are mutually exclusive elements of B (i.e, 2° 2%; =0 


whenever i= j) and if y is an element of B such that y 0. 
(FP: 1,5, 7, 


If z is an element of A and y is an element of B, then 
zr) or p(0Cz). 
But p(x Cz) =1 and p(0C az) =0. Hence 


THEOREM 3. If x is an element of ‘A distinct from the zero element and 


y is an element of B, then p(yC x) =1 or 0 according as x-y =~ or 0. 
7,98. 


The expression p(y Cx), where « is an element of ‘A and y is an element 
of B, is interpreted as an observation. We say that a success has been observed 
when this expression is equal to 1 and a failure when it is equal to 0. An 
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alternative interpretation is the following. We may regard z as a point and y 
as a set of points. The expression p(y C 2) is then the characteristic func- 
tion of point sets. It has the value 1 when the point z belongs to the set y 
and the value 0 otherwise. 

If x and y are elements of A such that x-y 0, then x-y=vz. Also 
z-y=y:'x=y. Hence r—y. Therefore 


THEOREM 4. If xz and y are elements of A and ry, then x: y=0, 


The element 0 has the property that if y is any element of B, then 
0-y=0. Hence 

THEOREM 5. 0 is an element of A. (?':3,%) 

If 2, 2%2,° are distinct elements of A, then 


But = 2 and hence 


Therefore 


THEOREM 6. If 2, 22,°**,@n are distinct non-zero elements of A, then 


(P:1,8. 7:1,8,4 


If 21, 2%2,° * °,%n are distinct elements of A and y is an element of B, 
then y: 2; =; or 0. In either case 


Hence it follows from Theorem 2 that 


THEOREM 7. If 2, are distinct non-zero elements of A and 


y is an element of B, then 


(P:1,8, 6,8, 7,10. 7:3,3,4,6) 
Let 0,1,2,°:-, be the first n + 1 elements of A and let r=1v2v---vn. 


Then z-m=—=m if m does not follow n and z:m=O if m does follow n. 
Therefore 
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TueEorEM 8. If 0,1,2,---+,n are the first n+ 1 elements of A, then 
(n) (P:1,4,12. T:4) 

If x and y are elements of B and n is an element of A, then p(aCn) 
is equal to 1 or 0 according as x- n is equal to n or 0. Similarly for y. Hence 
p(xCn)=p(yCn) ifa-n—=y-n. Thereforex—y if p(xCn) =p(yCn) 
for every element » of A other than 0. With the aid of Theorem 7 it can be 
proved inductively that « = y if p[a C (n)] = p[yC (n)] for every n of ‘A 
other than 0. Thus 


THEOREM 9. Jf x and y are elements of B having the same spectrum, 
then ay. Moreover if p[xC (n)] =—pl[yC (n)] for every n of A other 
than 0, then x = y. 


We observe that p[(n) C (n)] =—p[UC (n)]. Thus it follows from 
Postulate 14 that 


aC (n) =2C:U-(n) U-C (n). 
Hence 
p[zC (n)] (n)]. 


As a consequence of Theorem 9 we obtain 
The following four theorems are now obtained by simple substitution. 


THEOREM 11. p(xvy) =p(xv) + p(y) 
1; 


THEOREM 12. p(U) =1 and p(0) =0. 


Note that p(2) =1 does not imply x =U and p(x) =0 does not imply 
r= 0. 


THEOREM 13. If 2, %2,°**,% are mutually exclusive elements of B, then 
In) = + p(t2) + pla). (P21. T: 2,10) 


THEOREM 14. p(a@-~y) = p(x) — p(x-y) and p(~z) = 1— p(2). 


It follows from Theorems 7 and 8 that 0S p[aC (n)] 1. Hence by 
Postulate 13 we obtain 


THEOREM 15. 0S S1. 2 


If x and y are elements of B, then x: (n) =2,v22.v: + -V¥%m where 
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©, L2,°**, Xm are those non-zero atoms (taken in order) such that 2-7 = 2. 
Then p[zC (n)] = p[(m) C (n)] = m/n and, if x ~0, we can choose n 
so large that m0. It follows from Theorem 7 and Postulate 14 that 


(a) (n)] = p(yC /m—= ply C (m)]. 
On the other hand ’ 
plz-y:C (m)] p(y Ca) /n. 


Since p(y: Ck) =0 if is not equal to some 2; and 
p(y:z:Ck) =p(yCa) if 
Comparing (a) and (b) we see that 
(c) (n)] =plyCa-C (m)]-m/n 
= ply Ca-C (m)]- plz (n)]. 


From this point on we shall consider separately two cases. In the first case m 
increases without bound as nm becomes infinite and in the second case m 


remains finite. In the first case we obtain 
lim p[z:y: C (n)] =lim (n)]-lim p[y C z-C (m)] 


and hence p(x: y) = p(x): p(y Co). In the second case p(x) and p(y: =z) 
are both zero and hence irrespective of the value of p(yC zx) we have 
p(y:x) Therefore 


THEOREM 16. p(z-y) if 
(P: 1, 3, 4, 5, 6, 7, 12, 13, 14. 7: 3, 7, 8) 


By interchanging z and y in Theorem 16 we obtain 
= p(y) y) = p(z) 2). 


We can divide both sides of this latter equation by p(y) provided p(y) +0. 


Hence 


17, p(eCy) —2@) and ply) £0. 


P(y) 
(P23. TF: 36) 


Theorem 17 is a form of Bayes’ Theorem. To obtain a more usable form 
of this theorem we shall let y be an element of B such that p(y) ~0 and 
21, %2,°**,%n be a set of mutually exclusive non-zero elements of B. We shall 
suppose that y succeeds only if one of the events 2, %2,° - +, %n succeeds, i. e., 
2%, From this equation it follows that 
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and hence 
But by Theorem 16 
te) = p( ax) p(y © ax) 


and hence 
P(y) p(t) p(y %). 
Substituting this value of p(y) in the equation of Theorem 17 and replacing 
by we obtain 


THEOREM 18. Jf y is an element of B such that p(y) ~0 and 


are mutually exclusive non-zero elements of B and 
then 
Cy) = (P:1. T:13, 16, 17) 


P(t) ‘p(y © 
Let 2,, %2,--- be the non-zero elements of A (taken in order) such that 
p[(m) © (n)] = (n)]. 
It follows from Theorem. 7 and Postulate 14 that 


m 


C:a- (n)] => ax) /m 
k=1 
= ply (m)] p(y C2-Ck)/m. 
R=1 


Since the two summations are equal for all values of n and hence for all values 


of m, it follows that 
p(y C ax) = p(yC2x-Chk) 


and therefore yC x-k =k if and only if y-7,—2,. Thus 


THEOREM 19. Jf x its an element of B such that r540 and x, x,: - 
are the non-zero elements of A (taken in order) such that x- 2; —=2,, then 
yCu-k=k uf and only if Y Uy = Lp. 3, 12, 14, 7:3; 8) 


From Theorem 19 it follows that ~y-Cav:k=k if and only if 
x, But this latter equation holds if and only if y- and 
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hence if and only if yC #-k =0 and in turn if and only if ~-yCa:k=k, 
Therefore by Postulate 4 


THEOREM 20. (P:1,3,4. 7:19) 
In a like manner the following two theorems are easily proved: 


THEOREM 21. tf 250. 
3, 3,4, 39) 


THEOREM 22. if 250. 
(P:1,3,4. 7:19) 


Let and y be two elements of B such that and yC Then 
=k for all values of & and hence by Theorem 19 y- a = 2; for all 
values of k. Therefore and consequently 


Conversely if yy ~ a = U, we can multiply both sides of this equation by z 
and obtain 
Hence 
Cromer 
Therefore 
THEOREM 23. If x0, thenyCx=U if and only if yy~rx =U. 
(P:1,3,4,6. 7:19) 
Theorem 23 shows the relation between the symbol C as defined by this 
postulate system and the Whitehead-Russell definition of implication. The 
equivalence breaks down when z= 0. That is, in our system a contradiction 
does not imply all propositions. In all other cases where implication is applied 
in symbolic logic it is equivalent to the operation yC z defined in our system. 
Tt should be noted that the demand that a contradiction imply all propositions 


is not necessary to symbolic logic. 


3. Certain sub-sets of Boolean elements. The following definitions 
will enable us to gain further insight into the nature of the system which we 
have developed. 

DEFINITION 1. Let F consist of all finite disjunctions of elements of A 
and the negatives of such disjunctions. Then the elements of F are called finite 


elements. 


DEFINITION 2. An element x belongs to P if and only if there exists a 


POSTULATES FOR THE THEORY OF PROBABILITY. 753 


non-zero atomic element X such that x Cn=2C-n-+A for every non-zero 
atom n. The elements of P are called periodic elements. 


DEFINITION 3. An element u belongs to R if and only if there exist two 
elements x and z of F and an element y of P such that u=a-y:vz. The 
elements of R are called rational elements. 


DEFINITION 4. ‘An element x belongs to C if and only if there exist 
atoms r and n such that x-s =s or 0 according as s is or is not of the form 
r+i1+ikn where s is an element of A,n >0 and k =0,1,2,---. The ele- 
ment «x thus defined will be denoted by (r,n). The elements of C will be 
called fundamental rational elements. 


The elements of these various sets can most easily be described by means 
of their spectra. The spectrum of a finite element consists either of a finite 
number of 1’s and an infinite number of 0’s or of a finite number of .0’s and 
an infinite number of 1’s. The spectrum of a periodic element is periodic. 
The spectrum of a rational element may be irregular up to a certain point 
but from that point on it will be periodic. Thus the spectrum of a rational 
element is the sequence of digits of the binary development of some rational 
fraction. Conversely corresponding to any given rational fraction there is a 
rational element whose spectrum is the binary development of this fraction. 
The elements of C are special rational elements. The spectrum of the element 
(r,n) consists of r 0’s followed by a 1, then followed by n —1 0’s next a 1, 
then n — 1 0’s, next a 1, ete. 

It follows from Postulate 15 that if x is any element of B, then + C (n) 
is periodic with period n. The first n terms of the spectra of 2 and x C (n) 
are the same. These n terms are repeated periodically in the spectrum of 
aC (n). Next let us specialize the element z Let 0O=r<n. Then it is 
easily seen that r+ 1-C (n) = (r,n). We shall next investigate how the 
element (r,”) can be constructed when n =r. We shall consider first the 
element (7,1). In the spectrum of this element the first « terms are 0’s, the 
(r+ 1)-st term is a 1, and every term thereafter is a 1. Hence (r,1)—~(r). 
In particular (0,1) =~ (0) —=~0=U. It is now easily seen that if 
0Sr<n, then r+1-C (n): (kn, 1) = (kn+7,n). Hence any funda- 
mental rational element (7,) can be constructed by means of the operations 
permitted within the system. An arbitrary rational element can be constructed 
by means of a finite disjunction of fundamental rational elements and atomic 
elements. Thus suppose we have the binary development of an arbitrary 
rational fraction. We can start with the atoms and by means of the operations 
permitted within the system construct a rational element whose spectrum is 
the given binary development. 


or 
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The atoms and the rational elements may be considered as the analogues 
of the integers and the rational numbers respectively. Any finite combination 
of Boolean elements formed by means of the operators -, v, ~, C in the 
manner permitted by the postulates is called a rational function of those ele- 
ments. An arbitrary rational element can be constructed by forming the 
proper rational function of the proper atoms. Moreover any rational function 
of rational elements is a rational element. We have already observed that 
rational elements have ultimately periodic spectra analogous to the develop- 
ments of rational numbers. The probability of a rational element is equal to 
the total number of successes in a period divided by the length of the period. 
For example if x is a periodic element with period A, then p(x) = p[xC (A)]. 
The initial irregularity of a rational element does not effect its probability. 
Thus the probability of every rational element is rational. The following ten 
theorems are now easily proved. 


THEOREM 24. R includes all the elements of F, P, and C. 


THEOREM 25. If OSr<n, then (r,n) =r+1-C(n) and (r,n) 
ts an element of P. 


THEOREM 26. (7,1) =~ (r) and is an element of F. 

THEOREM 27. (0,1) =U. 

THEOREM 28. Jf OSr<n, then (kn+r,n) =r+1-C (n): (kn, 1) 
where k =0,1,2,°--. 

THEOREM 29. If x 1s an element of F, then zx is a finite disjunction of 
elements of A and C. 


THEOREM 30. Jf x ts an element of R, then x ts a finite disjunction of 
elements of A and C. 


THEOREM 31. If x is a non-zero element of P, then zx is a finite dis- 
junction of elements of C. 


THEOREM 32. F, P, and R each satisfy Postulate 1 and hence constitute 
Boolean algebras. 


THEOREM 33. Postulates 1-15 will be satisfied tf B is replaced by R. 


In previous papers (see Copeland I) the author has shown how the 
operations ~, -, v, C can be interpreted as operations on spectra. The spectra 
can in turn be characterized by numbers. Thus in the light of Theorem 33 
it is easy to construct an interpretation in which the elements are rational 
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numbers. This interpretation will satisfy all the postulates and hence the 
system is consistent. 


4, Relation to other systems. The postulate system of this paper is 
obviously not categorical. This lack of completeness permits a certain flexi- 
bility which enables us to express other systems on the base of our postulates. 
However in order to do this it is necessary to include further definitions. We 
shall consider first the system of Kolmogoroff which is based on the concepts 
of field and distribution function. To this end we shall introduce the following 
definitions : 


DEFINITION 5. A system B’ is called a field* tf tt satisfies Postulate 1. 

DEFINITION 6. A field B’ (with corresponding sub-set A’) 1s atomic tf 
B’ and A’ satisfy Postulates 2, 3, and 4. 

DEFINITION 7. A field B’ is a sub-field of B if B’ is a sub-set of B. 

DEFINITION 8. An atomic sub-field B’ of B is complete provided the 


following condition is satisfied: if x and y are any two elements of B such. 
that x-2=y-2z for every element z of A’, then x = y. 


Let B’ be a complete atomic sub-field of B and let x be any element of A. 
Then if y is any element of A’, it will also be an element of B and hence 
r‘y=2zor 0. But if e-y=0 for all elements y of A’, then it follows from 
Definition 8 that c= 0. Thus if x0, there must exist an element y of A’ 
such that x-y =z. If on the other hand z= 0, then x-y for every y 
of A’. Hence 


THEOREM 34. If B’ is a complete atomic sub-field of B and if x is any 
element of A, there exists an element y of A* such that rx-y =a. 


We shall now introduce the concept of distribution function.® 


DEFINITION 9. A function x ts a distribution function of a field B’ if 

(a) x(x) exists and is greater than or equal to 0 if x ts any element 
of B’. 

(b) w(xvy) +27(y) for any two elements x and y 
of B’. 

(c) w(U) =1 and r(0) —0. 


It will be observed that the set B itself is a field and that the function p 


“See Kolmogoroff I and Hausdorff I. 
5 See Kolmogoroff I. 


= 
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is a distribution function of this field. Thus B and p satisfy Kolmogoroff’s 
postulates. In Kolmogoroff’s system conditional probabilities are provided for 
by the introduction of new fields whereas in our system y C z is an element 
of B if x and y are elements of B and x0. However the field B is somewhat 
restricted by Postulate 12. This restriction is to a certain extent alleviated in 
the application of the system by a device which permits us to establish a corre- 
spondence between a sub-field B’” of B and a more general field B’. To this 
end we introduce the following definitions: 


DEFINITION 10. The following conditions are imposed on a correspond- 
ence x —> y between the elements x of a field B” (with distribution function p) 
and the elements y of a field B’ (with distribution function +). When such a 
correspondence exists, B’ is said to correspond to B’ and this fact is indicated 
by the symbol B’ > B”. 

(a) If x is an element of B’, there exists an element y of B’ such that 

(b) If y is an element of B”, there exists an element x of B’ such that 
y. 

(c) If and then y=z. 

(d) If r-y, then ~t>~y. 

(e) If and yr, then 

(f) If then x(x) = p(y). 

DEFINITION 11. The correspondence B’ > B” is said to be atomic if B’ 
and B” are atomic and if 

(g) For every element y of A” there exists an element x of A’ such that 

Let x and y be elements of B’ and wu and v be elements of B”’ (where 
B’ B”) and let uandy—v. Then 


Next let x and y be elements of B’ and let z be an element of B” such that 
z—zandy—z. Then 

Hence 


= p(z-~z) =0. 


We now have the following two theorems: 


Vv. 


it 
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THEOREM 35. If c—u and y—>v where the correspondences satisfy 


the conditions of Definition 10, then rvy—uvv. 


THEOREM 36. If «—>z and y—>z where the correspondences satisfy 
the conditions of Definition 10, then ~y:viy:~a2) =0. 


The correspondence defined in Definition 10 is not one to one. However 
an element of B’ uniquely determines the corresponding element of B”. 
Furthermore if two elements of B’ correspond to the same element of B”, 
it follows from Theorem 36 that these two elements differ by an element 
whose probability is 0. Thus the correspondence preserves all probability 
relations and at the same time permits B’ to be a much more general field 
than B”. In order for the theory to include the mathematical counterpart of 
the observation it is necessary for the correspondence to be atomic. When this 
is the case, we can establish a correspondence between the elements of A and 
the atomic elements of B’ in the following manner: 


DEFINITION 12. Jf x is an element of A, y an element of A’, z an ele- 
ment of A”, y—>z, and x-z=2, where B” is a complete atomic sub-field of 
B and the correspondence B’ > B” is atomic; then r—> y. 


Let x be an element of A other than 0. Then there exists an element z 
of A” such that «-z—z. The element z satisfying this equation is unique 
since the elements of A” are mutually exclusive. It follows from Definition 11 
that there is an element y of A’ such that y—z, i.e, ry. Next let us 
suppose that there are two elements y; and y, of A’ such that y,-—>z and 
y2—z. Then since the elements of A’ are mutually exclusive, 


But > ~z=0. Hence by Definition 10, Since this is a 
contradiction, the correspondence 2 — y uniquely determines y. We now have 


the following theorems: 


THEOREM 37. For every element x of A there exists an element y of A’ 
such that x—y, where the sets A and A’ and the correspondence x— y are 
defined as in Definition 12. 


THEOREM 38. If c> 41, T—> Yo, and x0 where the correspondences 
are defined as in Definition 12; then y; = Y2. 


Let u, v, x, y, 2 be respectively elements of B’, B’, A, A’, A” such that 
u>v, y>z, and e~0. Then u-y—uv-z and Hence 
if and only ifu-y=—y. Thus 


ff’s 
for 
nt 
lat 
in 
e- 
lis 
d- 
a 
d 
ut 


ARTHUR H. COPELAND. 


THEOREM 39. If u, v, x, y are respectively elements of B’, B’, A, A’ 
such that u->v, y, and ~0 where B’, A, A’ are related as in 
Definition 12, u—> v is defined as in Definition 10, and x > y is defined as in 
Definition 12; then v-x =z if and only if u-y=y. 


The set A’ can be interpreted as a space and the elements of A’ as points 
of this space. The set B’ is then a field of the space A’ and the elements of B’ 
are sets of points of the space. Definition 12 assigns to each non-zero element 
of A a unique label (Merkmal®), i.e., a unique point of the space A’. The 
inverse correspondence is not unique. In fact to a point of A’ there may corre- 
spond an infinite number of elements of A. Moreover there may be points of 
A’ which correspond to no elements of A. Definition 12 merely demands that 
there be a sub-set K of A’ such that to each element of A there corresponds a 
unique element of K and to each element of K there correspond one or more 
elements of A. Since A possesses a serial order, the correspondence determines 
a unique sequential arrangement of the elements of K. In the sequence K the 
same element may be repeated a finite or an infinite number of times. 

Let wu be an element of B’ and v be an element of BY” such that u—>v. 
Of the first n terms of the sequence K: let m be the number which are elements 
of the set u. It then follows from Theorems 7 and 39 that p[v C (n)] = m/n. 
Then m/n is the relative frequency with which an element of w occurs in the 
first n terms of K. We have lim m/n = p(v) =7(u). Hence the sequence 


K possesses the distribution (Verteilung) (u) defined over the field B’. 
The sequence K can be interpreted as a spectrum of a much more general 
nature than hitherto considered. The terms of K are the observations. These 
observations may be numbers, points, successes, failures, colors, or anything 
we please. 

The set B’ is still somewhat less general than the type of field permitted 
by Kolmogoroff. It is nevertheless sufficiently general for the theory of proba- 
bility. The extent to which B’ is restricted will be considered in detail in a 
subsequent paper. It will be sufficient here to note the results. We can start 
with an arbitrary denumerable field and extend it in the following manner. 
An element z will belong to the extended field provided that for every positive 
number e there exist two elements y and z of the denumerable field such that 
and r(z°-~y) <e. 

We have observed that K possesses one of the properties of a collective 
(Kollektiv *®). Namely it has a distribution. The remaining properties of the 
collective are expressed in terms of the operation of selection (Auswahl’). 


6 See v. Mises I. 
7 Mises I. 
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We have seen that in the expreession Ca we may regard a as a selection 
operating on the element x. In order to study the operation of selection we 
shall introduce the following definitions: 

DEFINITION 13. If xCa-Cb=—2xCe for every z of B, thenaX b=c 
and conversely. 

DEFINITION 14. A set S of elements of B is called fundamental * if and 
only af 

(a) S contains U. 


(b) Jf aand b are elements of S, then a X b is an element of S. 


The product a X b can most easily be studied by considering the corre- 


sponding spectra. In the spectrum of A let m,, m2,--- be the superscripts of 
the terms which are equal to 1 and let n,,2,--- be defined correspondingly 
for b. Then mn,, mMn,,*** indicate the positions of the 1’s in the spectrum of 
aX. It will be noted that if the sequence m,, mz2,--- is finite and the 
sequence 7;,%2,°** is infinite, a x b is not defined. We shall, however, be 
interested in the cases where both sequences are infinite. In this case the 
product a X b always exists and is unique. Next if »,,v2,- - - indicate the 


positions of the 1’s in the spectrum of c, then in the spectra of both 
(aX 6) Xc and aX (bX c), mn, Mny,»* are the superscripts of the 
terms which are equal to 1. Hence (aX b) Xc=aX (bX c). All the 
terms of the spectrum of U are equal to 1 and therefore a K U =U Ka=a. 
In the expression z C:a-b the terms of x which are selected by the operator 
a:b are those which correspond to 1’s in the spectra of both a and b. This 
selection can be performed in an alternative manner. We first make the 
selections x C b and aC b. If we now select from the spectrum of «C b 
those terms which correspond to 1’s in the spectrum of aC b, we thereby 
select from x those terms which correspond to 1’s in the spectra of both a and b. 
Hence rC:a-b=2rCb-C-aCb. It follows from Definition 13 that 
a-b=bX-aCb~—aX-bCa. By considering the spectra of the elements 
(r,n) and (p,v) of the set C (see Definition 4) it is easily proved that 
(r,n) X (p,v) = (r+ pn, nv). We have already noted that (0,1) =U. 
In the following theorems each of the elements a, b, c is such that its spectrum 
contains an infinite number of 1’s. This implies that these elements are 


different from 0. 


THEOREM 40. (a Xb) Xc=aX (bXc). 


Dorge I. 
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THEOREM 41. aXU=U  kKa=—a. 
THEOREM 42. 
tC:a-b=2Cb-C-aCb anda: 
THEOREM 43. The set C is fundamental, 
(0,1) =U, and (r,n) X (p,v) = (r+ pn, nv). 


From Theorem 43 it follows that the product a X b is not in general 
commutative. Moreover the operation aX b does not in general have an 
inverse since when an element has once been rejected in the selection process, 
it can never be returned by further selection. 

The following definitions enable us to study the systems of Wald, v. Mises, 
Reichenbach, and the author: 


DEFINITION 15. Jf B,,B2,--+,Bn is a set of sub-fields of B, then 
these fields are independent provided p(x, = p(2%1) p(X2) 
for every set of elements 2,,2%2,- + +,2%n such that x; is an element of B, 


DEFINITION 16. If B’ is a sub-field of B and a is an element of B, then 
B’ Ca is the field defined as follows: 


(a) If x is an element of B’, then x Ca is an element of B’ Ca. 


(b) Jf y is an element of B’ Ca, then there exists an element x of B’ 
such that x Ca=y. 


DEFINITION 17. A sub-field B’ of B is a collective field with respect to 
a fundamental set S provided p(xC y-Ca) =p(xC y) for every element 
a of S and every pair of elements x and y of B’. 


DeFINITION 18. Let B” be a collective field with respect to a funda- 
mental set S and let the correspondence B’ — B” be atomic. Then since A is 
an w-series, the correspondence defined by Definition 12 prescribes a sequential 
order for a sub-set K of A’. The sequence K thus defined is called a collective 
with respect to the fundamental set 8. (See Wald I.) 


DEFINITION 19. If the fundamental set S (of Definition 18) satisfies 
the Regellosigkettsprincip of v. Mises, then K 1s called a collective. (See 
v. Mises I.) 

DEFINITION 20. A sub-field B’ of B is said to be admissible if 

(a) plaC (r,n)] = p(a) for every element x of B’ and for every par 
of integers r and n such that OS r<n. 


I 
a 
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(b) The fields B’C (0,n), B’C (1,n),---,B’C (n—I1,n) are 
independent for every positive integer n. 

DEFINITION 21. A sequence K defined in terms of an admissible field 
B” in a manner similar to Definition 18, is called a normale Folge (Reichen- 
bach I) or admissible sequence (Copeland IT). 

DEFINITION 22. If the field B” satisfies condition Definition 20 (a) but 
not Definition 20 (b), then the corresponding sequence K is said to possess 
after-effect (Reichenbach 1). 


DEFINITION 23. A field B’ is called an admissible collective field with 
respect to a fundamental set S provided 
(a) It satisfies Definition 20. 


(b) Any operation of the form B’C x-Ca (where x and a are respec- 
tively elements of B’ and 8) transforms the admissible collective field B’ into 
a new admissible collective field B’ Cax-Ca. 


With the aid of the above definitions the systems of Kolmogoroff, Wald, 
v. Mises, Reichenbach, and the author can all be formulated on the base of our 
postulates. The system of Dorge could also be so expressed but a considerable 


number of additional definitions would be required. 


UNIVERSITY OF MICHIGAN. 
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ON THE NUMBER OF REPRESENTATIONS OF THE SQUARE OF 
AN INTEGER AS THE SUM OF THREE SQUARES.* 


By C. D. Oxps. 


1. Introduction. Let N,(n) denote the number of representations of 
an integer n as the sum of r squares. Two representations 


are distinct unless simultaneously 2 = yy, v= 1,2,:-+,7. Notice that in a 
given representation the signs of the roots as well as their arrangement are 
relevant. A zero square is considered positive. 
In 1884, T. J. Stieltjes [7%] + proved by means of elliptic functions that 
N;(p**) = 6p", where p is a prime of the form 8k-+1. Later, in 1907, 
A. Hurwitz [2] stated without giving the proof, that if? . 


8 
(1.1) == 274m == m= P?Q?=1(mod8), 
y=1 


where 9:1, J2,° °°, Qs are distinct primes of the form 4% + 3, and where P is 
the product of primes of the form 4k + 1, then 


(1. 2) N,(n?) = 6P [I [qv, av], 
where 
av] = qv” + 2(q—1)/(qv—1). 


This result was also stated by Stieltjes [7]. G. Pall [5] gave an analytic 
proof of (1.2) in 1930, and an arithmetical proof [6] using quaternions in 
1940. 

The author [4] has given an arithmetical derivation of (1.2) using a 
method similar to that used by Hurwitz [3] in his proof of the analogous 
formula for N;(n?). In this paper the same end is accomplished by using a 
method which may be regarded as a generalization of that used by Stieltjes 
[7] in his discussion of the special case V;(p**). 


2. Three lemmas. The work which follows is based upon an identity 
which has been proved arithmetically by J. V. Uspensky [8]. We call it 


* Received March 4, 1941; this is the third part of a paper presented to the 
American Mathematical Society April 6, 1940, under the title “On the number of 
representations of the square of an integer as the sum of an odd number of squares.” 
The author wishes to thank Professor J. V. Uspensky for help in preparing this paper. 

1The numbers in brackets refer to the bibliography. 

2The meaning of n, m, P, and Q given in (1.1) will hold throughout the paper. 
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LeMMA 2.1. Let F(z, y,z) be an arbitrary function defined for integral 
values of the arguments, and satisfying the following conditions of parity: 
F(—2,y,2) =—F(2,y,2), F(x,—y,—z) =F (2,y,2z), F(0,y,2z) =0. 
Then for any positive integer m =1 (mod 8), 

(2.1) F(A’ + 2A,h, A’ — 2A) = F(t + 2d, i— 28,1 + 2d — 2%) + T 
(a) (b) 
where 
T={>[F(s, 3,7) —F(j,s,7)]}; m = §?,s > 0. 
(c) 


The first two sums extend respectively over integral solutions of the equations 
(a) m=h?-+ 8Ad’, A’ odd; A, A’ both positive; 
(b) + Sodd; d,8 both positive; 
and the third summation runs through values 
(c) =1,3,5,---,s—2, m=s?, s>0. 
In T the symbolic notation {A} is equal to A itself when m ts a perfect square: 
m= s*, s > 0, and is zero otherwise. 


From the equation (2.1), by placing suitable restrictions upon the highly 
arbitrary function F(z, y,z), many interesting relations may be derived. We 
are interested in the one which follows. In (2.1) take 


P(x, y, 2) (—1) y), 
where ¢(z,y) is an arbitrary function satisfying the conditions 
$(x,y) = $(—2,y) = $(4,—y). 
The result of this substitution is 


(2. 2) (— 1) + 2d, — 28) 
= (— 8) — $(s, 7) ]}- 


(ec) 
If we now set 
o(z,y) = cos (ry/4), 
we get in place of (2. 2) 


2D (—1) 


(b) {(s—1) 1) — 2(— 1) (— 
(c) 


which is easily reduced to ® 


Lemma 2.2. 43 (2/i)p(m— 7?) =G(m), m—i??= 0, 
(d) 


® The symbol (P/Q) in Lemma 2.2 is the Jacobi symbol. 
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where (d) indicates that i runs over all positive odd integers rendering 
m— it = 0, and where 


G(m) = (—2/s)s if m=s?, s>0; G(m) =0 otherwise. 
The function p(n) is defined as follows: 


p(n) = p(0) = 1/4, 


the summation ranging over all odd integers 8 satisfying the equation n = dé. 


For convenience set 


=2Sp(m—##), ¥(Q) (2/i)p(m—*), 
(d) 


(d) 


=2 p(m—w), = 4d*(2/i)p(m—?). 
(d) 


(d) 
In the last two expressions the asterisk indicates that the summations extend 
over only those values of 7 which are relatively prime to Q. Interesting rela- 
tions exist between the functions just defined. Let d = (1,Q) be the greatest 
common divisor of and Q, and write i = de and R = Q/d, then (e, R) = 1. 
As d runs through the divisors of Q, R runs through the same set of divisors, 


but in a different order; hence we can write, recalling that m = P?Q?, 


d\Q e=1,3,5,... 
= > [2 p(P?d? — ?)] ¢(d), 
d\Q (d) 


where in order to obtain the last line we have to use the fact that p(a*b) 
= p(b) provided a is only divisible by primes of the form 44 + 3. We have 


found that 


d\Q 
whence, by inversion, 


2.3) $(Q) = p(d)®(Q/d). 


d\Q 
Here »(d) is the well-known Mobius function. The analogous relationship 
between ¥(Q) and ¥(Q) is obtained in exactly the same way. It is 


(2/Q)¥(Q) => (2/d)y(d). 


This leads, by inversion, to 


(2. 4) ¥(Q) => (2/d)¥(Q/d). 
The next step is to evaluate ¢(Q) +¥(Q) in two ways, first by using 
their defining equations, and second by combining (2.3) with (2.4). Equat- 


ing the two expressions thus obtained, and reducing, we get 


(2. 5) +< p(d) (2 d)¥(Q/d) = 4 d*p(m —i7), 


dlQ 


_ 
=> ¢(d), 
= 
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where we take e—-++1 or —1 according as the sum on the right runs 
through values of +==+1(mod8) or i==+3(mod8). To transform 
(2.5) further we have two cases to consider according as PQ =1, 3 (mod 8) 
or PQ=5, 7 (mod 8). When PQ =1 or 3 (mod 8) and i==+ 3 (mod 8), 
or when PQ = 5,7 (mod 8) and 1= + 1 (mod 8), then in either case one of 
the numbers PQ —i or PQ +1 will be of the form 8k + 6 and hence will 
contain an odd number of prime factors (not necessarily distinct) which are 
of the form 4k + 3. Consequently, in either case, m — 7? will contain primes 
of the form 4k + 3 in odd power, and because of this p(m— 7?) will be zero. 
It is clear, therefore, that we can replace (2.5) by 


2 #(d)®(Q/d) = 3(— 2/PQ) (2/d)¥(Q/d), 
since, we recall, (—2/PQ) =-+ 1 or —1 according as PQ =1,3 (mod 8) 
or PQ =5,7 (mod 8). From this last equation it follows that * 
w(d)®(Q/d) - (P/2) (— 1/d) (Q/d) = (P/2)2(Q), 
whence, by inversion, 
= (P/2) 
But, on the other hand, | 


where 
Thus 


— (P/2) (P/2) TE [1 + 


8 
(P/2) IL + —1)/(q —1)]. 
We have established 
LEMMA 2. 3. = (P/2) II w]. 
y=1 
3. The formula for N,(n*). To begin with, we see that : 
N3;(n?) = N,(274m) = N3(m). 


In order to evaluate N;(m), notice that, by hypothesis, m= 1 (mod 8), and 


consequently among the roots of the equation 


m=? + 7? 


‘From now on the symbols (P/2), (Q/d) and (q,av/d) denote fractions and not 


the Jacobi symbol. 
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one root, say 7, must be odd, while the other two must be even. If S represents 
the number of solutions of this equation in which 7 is odd, then it is an easy 


matter to verify that 


S—=2 N.2(m—7?), (1 = 1, 3, 5,- 


the summation extending over all odd integers rendering the argument non- 


negative. It follows, therefore, that 


N;(m) = 38S =6 N.(m— 7’). 
(a) 


On the other hand it is well known that ° 


N.(m) =4p(m), p(0) = 1/4, 


so that 


N;(n*) = Nz(m) = 243 p(m— 7?) = 126(Q). 
(a) 


Now apply Lemma 2.3. We get 


N;(n?) = = 6P II [qv, av], 
) 


which is the desired result (1. 2 


PURDUE UNIVERSITY, 
LAFAYETTE, INDIANA. 
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ON 1-BOUNDING MONOTONIC TRANSFORMATIONS WHICH 
ARE EQUIVALENT TO HOMEOMORPHISMS.* 


By EpitH R. SCHNECKENBURGER. 


1. Introduction. A monotonic” transformation of a space A on to a 
space B is a transformation such that the inverse of each point b of B is 
connected ; in other words, the 0-dimensional Betti number of the inverse of b 
is zero. In this paper such a transformation will frequently be called an &,- 
transformation. It could also be called a 0-bounding transformation. 

A 1-bounding transformation, or #,'-transformation, of a space A on toa 
space B is a transformation such that for each point b of B the 1-dimensional 
Betti number of the inverse of 6 is zero. 

An £&,°!-transformation is a transformation which is monotonic and 
1-bounding. In general, an R,°'---"-transformation of a space A on to a 
space B is a transformation such that for every point b of B the i-dimensional 
Betti number, 1 = 0,- - -,n, of the inverse of b is zero. 

In this paper we consider a single-valued, continuous, /,°!-transforniation 
T of a space A on to a space B such that 7 is equivalent to a homeomorphism. 
We shall say that a transformation 7'(A) = B is equivalent to a homeomor- 
phism when B is homeomorphic to A. 

In a paper* by Roberts and Steenrod it is shown that a single-valued, 
continuous, of a compact two-manifold without boun- 
dary, gives a space homeomorphic to VM. This is a generalization of a theorem 
of R. L. Moore which states this result for the case in which M is a two-sphere.* 
These results suggest the consideration of the following questions: 

1. What spaces, other than two-manifolds, satisfy the condition that 
every 7 of the type under consideration is equivalent to a homeomorphism ? 

2. What conditions must be placed upon a particular T to make it equi- 
valent to a homeomorphism on a given space A? 

In section 3 we state conclusions of the nature described in question 2. 


In considering question 1. we have confined our attention to the study of a 
q 


* Received July 29, 1940; revised March 4, 1941. 
1 Presented to the American Mathematical Society, September 7, 1939. 
2 Defined as a monotone transformation by C. B. Morrey, American Journal of 


Mathematics, vol. 57 (1935), p. 18. 
3 J. H. Roberts and N. E. Steenrod, Annals of Mathematics, vol. 39 (1938), Theorem 


1, p. 854. 


*R. L. Moore, Transactions of the American Mathematical Society, vol. 27 (1925), 


Theorem 25, p. 247. 
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space A which is a Peanian continuum imbedded in a two-manifold M. Our 
results are: 

(a) If A is cyclic, A is a simple closed curve or the two-manifold M. 

(b) If A is acyclic, A is an arc. (This is true for any acyclic Peanian 
continuum. ) 

(c) If A is neither cyclic nor acyclic and certain restrictions are put 
upon the cut points of A, then A is a finite or countably infinite set of simple 


closed curves tangent to each other at a single point p. 


We have concluded that a complete characterization of A, if possible, 


would be too lengthy and tedious to be interesting. 

The author wishes to thank Professor W. L. Ayres of the University of 
Michigan who suggested the investigation of this problem and gave many 
helpful criticisms and suggestions during the preparation of this paper. 

2. Notation. In this paper, the following notation is used: H = homeo- 
morphism ; 7’ = single-valued, continuous, #,°!-transformation ; Y = Peanian 
continuum contained in a compact metric space; == compact two-manifold 
without boundary: A = imbedded in MV; B=T(X) where B is contained 
in a metric space, and it is assumed that B is never a single point. 

We shall also have occasion to use these symbols: R‘(S) =mod 2, 


i-dimensional Betti number of a set S; <ry> = ry — x — y where zy is an arc 


with endpoints x and y; ry> =arc ry minus end point y; <rzy = are ry minus 


end point z. 

A knowledge of the cyclic element theory, as developed in a paper by 
Kuratowski and Whyburn,* will be presupposed. 

3. Conditions that a particular transformation be equivalent to a 
homeomorphism. A vertex y of a finite graph XY will be called non-essential 
if it is the end point of exactly two 1-cells xy and yz and x=2z; otherwise, y 
is an essential vertex. 

Lemma 1. The 1-cells of any finite graph X may be so re-defined that 
every verlex is essential. 

THEOREM 1. If X is a finite connected graph, a necessary and sufficient 
condition that B=T (NX) be homeomorphic to X is that T(u) #~T(y) for 


any two essential vertices u and y of order 


Corotitary. Jf X is a simple closed curve, B =T(X) is also a simple 
closed curve.® 

°C, Kuratowski and G. T. Whyburn, Fundamenta Mathematicae, vol. 16 (1930), 
pp. 305-331. 

°G, T. Whyburn, Bulletin of the American Mathematical Society, vol. 42 (1936), 
p. 66. 
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LemMaA 2. If Ho 1s an order preserving homeomorphism between a closed 
subset S of an arc a—<a,d2 and a subset Ho(S) of an arc B=b,b. and 
a, +a, C 8,b,+ 6, C then there exists a homeomorphism H between 
aand B which agrees with Hy on 8. 


THEOREM 2. Jf X is a tree, S is the set of all end points and branch 
points of X and T 1s a single-valued, continuous, R,°-transformation such that 
T(u) AT (y) where u and y are any two points of 8, then B=T(X) is a 
tree which is homeomorphic to X. 


The proofs of the lemmas and theorems, stated above, are not included 
in this paper since the method of proof in each case is quite obvious. Lemma 1 
is used in the proof of Theorem 1 and Lemma 2 in the proof of Theorem 2. 


4. Conditions that every transformation be equivalent to a homeo- 
morphism. 

LemMaA 3. If A tsa proper subset of M which is cyclicly connected but 
is not a simple closed curve, there is an acyclic continuum in A which 
separates A. 

Let z be any point of A which is on the boundary of some complementary 
domain of A in M. Choose an open 2-cell neighborhood F of z in M such that: 
(1) A-(M—E) $0, and (2) A-(M—E£) where r+ yCH. 
Since A is not a simple closed curve, it is always possible to select an F 
satisfying condition (2). Denote by C the simple closed curve which is the 
boundary of # and by N the component of A — A-C which contains z. Then 
N is a continuous curve which is contained in FE + C. 

It follows from a theorem of W. L. Ayres® that a point q of C-N is 
separated from z in N by a finite number n of arcs® 2; of N, where n is the 
number of components of N —2—gq which have z and q as limit points. 


Let Nz be the component of N — >a; which contains g. The set of com- 
i=1 


ponents NV, for all points of C - N is a collection of open sets covering the closed 
and compact set C-N. By the Heine-Borel theorem a finite number of these 
components cover C-N. Denote these components by Ni. 


For each Nj; there is a finite number of arcs a;; such that } a;; = Wi 


j 
separates x from C’- Nj in N. If, for some 0 and the set 


7R. L. Moore, Foundations of Point Set Theory, American Mathematical Society 
Colloquium Publications, vol. 13, theorem 57, p. 232. 
8 W. L. Ayres, Proceedings of the National Academy of Sciences, vol. 14 (1928), 


theorem 2, p. 205. 
® An are may be a single point. 
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aj; + %x == V is not acyclic, we replace V by a subcontinuum V’ which is 
acyclic. The set V’ is obtained by removing from each simple closed curve of 
V the interior points of a subarc which contains no branch points. There is 
such an arc in each simple closed curve of V because the number of arcs nx 
is finite and hence there can be at most a denumerable number of branch 
points of V on any simple closed curve contained in V. It is evident that V’ 
is connected, closed, and acyclic. 

We repeat the above process for every pair of arcs of W = > W; whose 

i 


sum contains a simple closed curve. Thus, we obtain a finite set of acyclic 
continua Bi: Bj =0, where is either an arc ax or 
an acyclic continuum contained in the sum of two or more of these arcs. 


Then kR-Q=R-O=0, R connected, reR and 


C-NCQ. Let 27,0, be an are of R+ Sf; such that 2,b,5 CR, br € Bi. 
w=1 


In general, denote by 1 = 2,---,s, an are such that <2ibi> CF 
| jal 
ai a point of and bj a point of By. Let A, = (Bi + 7ibi). The 
j=l i=1 
set A, is closed because it consists of a finite anil of acyclic continua. 
Since each z;b; has one and only one point in 3 (8; + xjb;), A, is connected 
j= 


and acyclic. 
s 
There are two cases to be considered. Let us suppose first that R + ¥ Bi 
i=1 
¢ A;. Then, N A, S; + S, = . S. 0, R A, R S;, 
C-NCS8,.. Since any are joining R—A,:-R to A: (M 
a point of C’- N, the set A, is an acyclic continuum separating A. 


EF) must contain 


In the second case, A, = R+ 5 Bi. Let z be a point of order 2 of R, 


2,222 any subare of RF containing no branch point of A,, and A’, and A”, the 
components of 4, —2z,22.. Since A is cyclicly connected, there is an arc p 
which joins A’, and A”; and does not contain z. This are p contains a subarc p; 
having only its endpoints in A’, and A”;. These endpoints must be contained 


in > Bi, hence, (2:22) = 0. Let A, = + Then A— A> 

— 2:22 ~ 0 because it follows from condition (2) that 4 — A-F cannot be a 
subarc of p; Hence, the acyclic continuum A, separates < 2,22 > from points 
of A— A,— 2,2. Thus, in either case, there is an acyclic continuum in A 
which separates A. 

THEOREM 3. If A is cyclicly connected and B == T(A) is homeomorphic 
to A for every T, either A=M or A ts a simple closed curve. 


8 
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The fact that A can be the two-manifold M follows from a theorem of 
Roberts and Steenrod.* Hence, we need consider only the case in which A isa 
proper subset of MV. 

We suppose that A is not a simple closed curve. By Lemma 3 there is an 
acyclic continuum A, in A which separates A. Define 7'(A) as follows: 7'(A,) 
is a single point, 7 is a homeomorphism on A —A,. Then 7'(A;) is a cut 
point of B and B is not homeomorphic to A because A is cyclicly connected. 

Hence, if there exists any set A which is a proper subset of M and satisfies 
the hypothesis of the theorem, A must be a simple closed curve. The existence 
of A follows from the Corollary to Theorem 1. 

THEOREM 4. If X is acyclic and B=T(X) is homeomorphic to X for 
every T, then X is an are. 


Let «, be any are of XY joining two end points of X, ¥ r; the set of branch 
i 


points of X which are contained in 2,, and S; the sum of all components Dj; 
of XY —r; such that Djj-«,—90. Define T(X) as follows: T is a homeomor- 
phism on T7(S;) =7T(r;). Then T is a single-valued, continuous, 
transformation. Further, since 7(X) =T7(2,) =B and T is a homeomor- 
phism on @,, B is an arc. Then B cannot be homeomorphic to X unless X is 


also an arc. Thus 8S; = 0 for every 7. Since every transform of an arc is an 


are when T is monotonic,® B is homeomorphic to X for every 7’ when X is an | 


are. Hence, if X is acyclic, it is an are. 


THEeorEM 5. If B=T(X) is homeomorphic to X for every T, then the | 


closed extension of every component of X —C;, contains at least one Ci, 


where C; is a true cyclic element of X. 


Let x; be any cut point of X which is contained in some C;; and let Dj be Ff 


. . 
any component of X — C; which has z; as limit point. Denote by D; the sum § 
of all the components Dj, such that Dj, contains no C;. We shall prove that D; | 


is vacuous. Define a transformation 7(X) = B as follows: T (Dj + = 
T(x) =z where z is any point of X¥ — > Dj. 
j 

It can be shown that 7 is a single-valued, continuous, ?,°'-transformation. 
Obviously, 7 is single-valued. It is also evident that 7 is continuous over any 
set of points contained in one set D; and over Y —¥ Dj. Hence, to prove 

that 7 is continuous it is necessary to consider only sequences of points {dj}, 


where d; is a point of Dj. Let d be the sequential limit point of such a§ 


sequence d;,. Then d is contained in Y — } D; because each set D; is open 
j 
and can contain no limit point of 1 — D;. Therefore, 7'(d) =d. Since for 


every « > 0 there is a 8 >0 such that S(d,8) is contained in a connected 
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subset of S(d,«) and since the cut point z;, separates d and dj;,, it follows 
that all but a finite number of 2;, are contained in S(d,e); that is, d is 
a sequential limit point of {2;,}. Hence, T(d) =d is a limit point of 
{T = {2j,} = {T(d;,)}. This proves that T is continuous. 

The continuum ); + 2; contains only cyclic elements, namely, cut points 
and end points of \, and is therefore a continuous curve.’° From this fact 
and the fact that D; + 2; is defined so that it contains no simple closed curve, 
it follows that R'(D; + 2;) =0. Thus, T satisfies all the required conditions. 

Since T is the identity transformation on each C;, T(Ci) is a true cyclic 
element of B. Furthermore, Bj, any true cyclic element of B, is contained in 
T(C;) for some j.11. Hence, Bj =T(Cj). Let b, be any cut point of B 
which is contained in some B;. The closed extension of every component of 
B—b, contains at least one 7(C;) = B;, because T is homeomorphic on 
X—>D> Dj. Thus, B is not homeomorphic to VY unless Dj, of Y for every 7 

j 


and h satisfies the condition that Dj, contains at least one Cj. 


Lemma 4. Jf S is a closed subset of X and each component S; of S is 
contained in a continuum L; of X where =0 and Li: Lj = 9, 
then S is contained in a continuum W of X such that R’(W) =0. 

Let L=>L; and define T(X) =B as follows: T(Li) is a single 


t 
point, 7 is a homeomorphism on Y — L. We shall see that this transformation 


satisfies all the conditions required of T. It is evident from the definition that 
T is single-valued. Also, T is an R,°'-transformation because 7T-*(b) for 
every point b of B is a single point or some L;. Lj is a continuum, and 
h'(L;) =0. We prove that T is continuous by showing that the collection of 
all the sets 7’-*(6) is an upper semi-continuous collection of continua filling YX. 
Since every set 7-'(b) which is in Z is a component £; of LZ and each Lj; is 
compact, it follows from a theorem of R. L. Moore ?* that the sets T-*(b) which 
are in J form an upper semi-continuous collection of continua filling L. 
Furthermore, if {2;} = {7-1(bi)} is a sequence of points of X —L which 
converges to x, a point of 7-1(b), then every infinite subsequence of {7-*(bi)} 
converges to x. Hence, all the sets 7-1(b) form an upper semi-continuous 
collection of continua filling Y. We conclude that 7 is continuous, for Kura- 
towski** has shown that an upper semi-continuous collection of closed sets 


10G,. T. Whyburn, American Journal of Mathematics, vol. 50 (1928), theorem 10, 
p. 177. 

1G. T. Whyburn, American Journal of Mathematics, vol. 56 (1934), theorem (3.1), 
p. 298. 

12 See footnote 7, Theorem 20, p. 340. 

180, Kuratowski, Fundamenta Mathematicae, vol. 11 (1928), theorems 2, 3, pp. 
172-173. 
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filling a compact metric space is equivalent to a single-valued, continuous 
transformation defined on the space. 

It is readily seen that 7(Z) is a Moore-Kline subset of B.1* The set 
T(L) is closed and compact because T is continuous. Since 7 is also a 
F,°-transformation, connectedness is an invariant of 7-115 and, hence, each 
component of 7'(L) is a single point. It follows from a theorem of Zippin 
that 7 (LZ) is contained in a tree B, of B. 

Since 7 is continuous and monotonic, 7-1(B,) is a continuum. It has 
been shown by Vietoris *’ that the one-dimensional Betti number of a compact 
closed set in a metric space is an invariant of T. Therefore, R'[7-*(B,)]=—0 
because #1(B,) =0. Since T(L) C B,, LC T-*(B,). Hence, C T-*(B,) 
and 7*(B,) is the set W described in the statement of the theorem. 


THEOREM 6. If B=T(A) is homeomorphic to A for every T and every 
component of P-C; is contained in a continuum Li; of Cy such that 
ii (Li;) = 0, where P is the set of all cut points A and C; is any true cyclic 
element of A, then A= Ci, Ci is a simple closed curve, and C;,- Cj =p, 

i 
a single point, for i= j. 

It follows from Lemma 3 that the continua Lj; of C; are contained in a 

continuum J; of Cy such that R'(L;) =0. We shall prove first that the 


components S; of } L; also satisfy the hypothesis of Lemma 3. Let S, be any 
i 


k 
component of Then 8,— > L’n where Co. If Si: = 
i 


n n=1 


k 
where. is finite, S,;=8,. Also, = is a Peanian continuum. If 
n=1 


S,= > Ln, then VN, = > C’» is a connected subset of A which contains only 

n=1 n=1 
cyclic elements of A. Hence, by a theorem of Whyburn,’® 4, is a continuous 


curve and each point of V,— N, is a cyclic element of A, namely, an end 

point or a cut point of A. Since N, is a continuous curve in both cases, 

R1(N,) = > B(C’n).° Hence, the basis of the 1-cycles of N, may be con- 
n 


sidered a set of 1-cycles each of which is contained in some C’». Since 8S; C ™,, 
it follows that 8, C N, and that any 1-cycle of 8, is a 1-cycle of N;. There- 


141, Zippin, Transactions of American Mathematical Society, vol. 34 (1932), pp. 
705-721. 

15 See footnote 11, theorem (1.3), p. 295. 

16 See footnote 14, theorem 5.1, p. 709. 

17 J,. Vietoris, Mathematische Annalen, vol. 97 (1927), theorem 8b, p. 470. 

18 See footnote 10, theorem 11, p. 177. 

1° See footnote 11, theorem 2.5, p. 138. 
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fore, the basis of 1-cycles of 8, can be chosen so that it is a set of 1-cycles, each 
of which is contained in some (’, and hence in some L’n. It follows that 
R'(S,) = L’n) =0 because = 0 for every n. 
n 
Furthermore, any component S; of > L; satisfies the same conditions as 


4 


one of the sets S, just considered. To see this we need to consider & L; — > Li. 
i i 


Let x be any point of this set. Then z is a limit point of a subsequence {Li,} 
of © L; and hence is contained in §; = > Lj,. Therefore, we conclude that 

i 
> L: => where is a continuum, (S;) = 0, and = 0, 1547. 
i j 
Thus, the sets S; satisfy the hypothesis of Lemma 3 and are contained in a 
continuum S of A such that = 0. 

We show now that C;:C; =p, a single point, for ij. Define T(A) 
as follows: 7(S) is a single point, 7 is a homeomorphism on A—S. If 8 
does not cut Cj, 7'(C;) is a true cyclic element of B. If S cuts Ci, let Di; 
be a component of C; —C;,-S. It will be demonstrated that T (Di; + Ci: 8) 
= Bj; is a true cyclic element of B. Since T is continuous, By; is a continuum. 
Also, Bij; can fail to be locally connected only at the point T(S) and con- 
sequently is locally connected.*° It remains to be seen that Bi; is cyclicly 
connected. 

Let x and y represent any two points of D;;. Since C; is cyclicly con- 
nected and D;; C Ci, the points x and y lie on some simple closed curve C’ 
of C;. If C’ C Di;, T(C’) is a simple closed curve in By; because T is a 
homeomorphism on Dj;. If 0’ C Dij, there are two cases to be considered. 
The curve C’ may satisfy the condition that one of the arcs zy of C’ is in Di;. 
Then there is an arc waryz of C’ such that <uz> C Di; and u+2C C;:8. 
Since T(w) = 7'(z) and T is a homeomorphism on <uz>, T(uryz) is a simple 
closed curve of Bi;. Hence, in this case T(x) and T(y) are on a simple closed 
curve in B;;. 

In the second case neither are zy of C’ is contained in Dj;. Then there 
are ares uv and wyz of C’ such that <urv> and <wyz> are in Dj; and 4, v, 
w,zare points of S. There is an are $ in Dj; joining z and y because Dj; is a 
component of C; —C;-S. In the order z to y let s be the last point of uv 
and ¢ the first point of wyz on 8 There are four cases depending on the 
location of s and ¢. Suppose that s is a point of zv and ¢ is a point of wy. 
Then the arc « = urs + st + tyz, where st C 8 and uas + tys C C’, satisfies 
the conditions that «+ y Ca, <a> C Dij, andu+vCS8. Hence, isa 
simple closed curve of Bi; for the same reasons as those given in the preceding 


20 See footnote 7, theorem 8, p. 95. 
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case for T(uryz). The other cases corresponding to other positions of s and ¢ 
on wxv and wyz can be discussed similarly. Therefore, we conclude that Bj; 
is cyclicly connected. 

Since Di; was any component of C; — C;-S, it follows that T(C;) is a 
set of true cyclic elements of B having only the point 7(S) in common. 
Furthermore, Bj, any true cyclic element of B, is contained in T(C;) for 
some j. Hence, Bj: Bj; = T(S) for i~j and B is homeomorphic to A only 
if Cj is a single point p, ij. 

It follows from Theorem 5 that A =» (; for if some C; contains a cut 

i 


point p’ of A, p’~>p, the closed extension of every component of A —(; 
contains a true cyclic element of A. Hence, there are in A true cyclic elements 
C; and Cx, such that Cj;-C; =p and Cy-Ci =p’. It has just been shown 
that this is impossible since C;- Cj; = p for every ij. 

If some C;, say C,, is not a simple closed curve, let M’ be a manifold 
homeomorphic to M and let H(C,) be the homeomorph of C, in M’. Then 
H(C;) is a cyclic connected Peanian continuum contained in a compact 
2-manifold without boundary and hence Lemma 3 can be applied. Since the 
2-cell H, which was used in the proof of Lemma 3, can be chosen so that £ 
does not contain a certain point ps2, there is an acyclic cut set A, of 
H(C;,) which does not contain H(p). Define T(A) thus: T[H-'(A,)] is a 
single point, 7 is a homeomorphism on A—A,. Then B contains two cut 
points, T7(p) and T[H-*(A,)], and hence is not homeomorphic to A. We 
conclude that each C; is a simple closed curve. 

The Peanian continuum Cj, Ci: Cj = p for i~j, Ci a simple 

closed curve for every 1, satisfies the hypothesis that B is homeomorphic to A 
for every T. Since T is continuous and monotonic, T-'(b) for every point b 
of B isa continuum. Moreover, 7~'(b) - Ci, for any 1, is a continuum because 
the intersection of any continuum and a true cyclic element of a continuous 
curve is a continuum. Consequently, 7 is monotonic on each simple closed 
curve C; and 7'(C;) is a simple closed curve by the Corollary to Theorem 1. 
Also, (Ci) -T(C;) =T(p),i-~j, because T is a Ro°-transformation. Hence, 
B is homeomorphic to A. 

Corottary. If A contains only a finite number of true cyclic elements 


Ci, i=1,---,n, and B=T(A) is homeomorphic to A for every T, then 


n 
A=DCi, Ci: Cj = p, ij, Ci is a simple closed curve. 
i=1 
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ON THE USE OF INDETERMINATES IN THE THEORY OF 
THE ORTHOGONAL AND SYMPLECTIC GROUPS.* 


By HERMANN WEYL. 


1. These lines contain a supplement, concerning the use of indetermi- 
nates, to my book, The Classical Groups, Princeton (1939), which I cite in 
the following as CG. The general infinitesimal operation of the orthogonal 


group, 
(1) dz == Sx, 
is described by a skew-symmetric matrix S = || sq || of which the n(n—1)/2 


elements siz (1 < &) are indeterminate parameters. It is to be expected that 
the first main theorem for vector invariants of the orthogonal group will hold 
if invariance is demanded only with respect to the infinitesimal operations (1). 
In this form the theorem cannot be affected by the field in which we operate, 
as long as it is of characteristic zero. In my book I approached the problem 
of vector invariants for all groups discussed in a systematic way by combining 
the formal apparatus of Capelli’s identities with such non-formal reasoning 
as is used to prove the geometric theorem of congruence. The topological fact 
that the proper orthogonal group is a connected manifold shows that for the 
field K of all real numbers the full group may be replaced by the set of its 
infinitesimal elements. Actually I employed an algebraic equivalent for this 
topological argument which, by a somewhat devious procedure, carries the 
results over to “formal” and “infinitesimal” invariants. I now think it 
would have been wiser to settle the question of infinitesimal invariants for the 
orthogonal and the symplectic groups directly by means of another formal 
identity of Capelli’s type which I propose to develop in § 2. 

Besides the question of invariants there is connected with any group Tr 
of linear transformations A = || a, || in a vector space P an ideal 9(T). 
It consists of all polynomials @(A) of the n? variables aj, which vanish for 
each element A of I. Whereas invariance under the whole group is secured 
when it holds for any set of generators of the group, e. g., for the infinitesimal 
elements, it is essential to postulate the vanishing of ® for all elements A of 
the group; for it is not true that the vanishing of @(4A,) and ®(A2) implies 
the vanishing of ®(A,A.). In proving that (ZY —S)/(E+8) is a generic 
zero of the orthogonal ideal, I should therefore have observed that the 


* Received May 16, 1941. 
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S)/(£ +8) with its n(n —1)/2 indeterminates six (i < i) 


constitutes a group, in the sense that the equation 


expression 


2) 
E+S 


defines a skew-symmetric matrix S in terms of two indeterminate skew- 
symmetric matrices S’ and 8”; the parameters of S are rational functions of 
those of S’ and 8”. Not only did I fail to emphasize this point, but the wrong 
formulation of Theorem (5.3. B) betrays that I overlooked it altogether. The 
elements 


E+ 8’ " E+S 
constitute a group in the above sense. Hence the hypothesis imposed upon 
@(A) in that theorem should have required its vanishing for (3) (and not 
merely for (H—S)/(£-+ 8) and J»). The corollary immediately following 
the theorem and Lemma (7.1. B) ought to be corrected accordingly. (§ 3) 
An important link of the investigation is the fact that the set 
II;[(Z—S)/(£+8)], being a set of orthogonal transformations in the 
tensor space Py, is fully reducible. We shall deal with this point without 
making numerical substitutions for the indeterminates sj, and, in the case of 
the symplectic group, without recourse to the “unitarian trick.” ($4) We 
carry the details through for the symplectic group. The modifications required 


for the orthogonal group are obvious. 


2. The codrdinates of a vector x in n = 2v dimensions will be denoted by 


Indices i,k,: run over the whole range 1,---,v, 1’,- whereas 
a, B,-*- run merely over the half range 1,---,v. The skew-product [ay] 


of two vectors x, y is defined by 
[ry] => — = e(tk) x. 

Its matrix || e(ik) || is called 7. An infinitesimal operation dr = Sx of the 
symplectic group has a matrix 

8(28),  s(ap’) || 

S=l | 
for which 
(4) s(a’p’) =—s(Bu); s(aP’) =s(Bo’), 
As its n(n + 1)/2 parameters s; we use 


all s(a@B); s(aB’) for a= Bf; s(¢’B) for 


lg 


le 
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‘ 


We refer to these conditions (4) by speaking of an, S-matrix. Thus we write 
the indeterminate S-matrix as L(s) = 3Ljs;. 

Let f(z,y,° - -) be a polynomial homogeneous of a certain degree with 
respect to the components of each argument vector. Let e be the number of 
argument vectors and r the degree of f with respect to x. In order to express 
invariance of f under infinitesimal symplectic transformations, we introduce 
the matrix ¥ — vé with the elements X (ik) = 2;&, composed of a covariant 
vector z (single column) and a contravariant € (single row), and moreover 
the matrix X’ with the elements 


A? (WB) = + X’ = — 
Incidentally 


= X + 
Setting = 0/0x; turns XY (ik) and X’(ik) into differential operators d,(ik), 
d@,(ik). The infinitesimal invariance of f is expressed by 


(ik) f =, 


the sum extending over the e argument vectors y of f. 


Let us start with the identity 


(2) | 


(éy) [én] | 


(If one prefers, one may write the trace on the right side as } tr (X’Y’), but 
calculations are a little bit easier with the original form.) Put = 0/0z, 


» = 0/dy; the polar D,zf and the operator 


0 0° 
a a OX’ 


make their appearance. Both are infinitesimal symplectic invariants if f is. 


M 


@(2y; plays the same réle for the symplectic group as the Laplacian operator 


=> 


does for the orthogonal group. Handling the operators D and d as if multi- 
plication by 2; and differentiation 0/02; were commutable, one would obtain 


from (5) the equation 
[ry] - Open f — = — tr (dz, Myf). 


What one really gets are the equations 


of 
_| 
n 
ot 
le 
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[ry] Open f — Dey(Dyef) + rf = — tr (de, d’yf) 
if and 
[ra] —r(r + n)f =—tr (dz, 


for yz. Under the assumption that f is an infinitesimal symplectic 
invariant (briefly, an invariant), we thus find 


(6) ~ [zy] ®penf — ~ Doy(Dyef) = +n+1—e)f 


where >, >’ designate summations over all e arguments y including or 
yy 


excluding the fixed argument z. 
We maintain: 


THEOREM I. Lvery invariant f(x, y,---) ts expressible as a polynomial 
in terms of the skew-products [xy],--- of tts arguments z, y,::-. 


Suppose first e=n-+1. The polar Dycf for yz and @;2,;f (whether 
y = 2 or 7) are invariants of lower degree in than f. Hence the equa- 
tion (6) proves the theorem by induction with respect to r for any degree r 
under the hypothesis that it is true for r= 0, i.e., for invariants depending 
only on the e 


1 arguments y,--:. Thus induction with respect to e yields 
the desired result for e= +1. But Capelli’s general identity allows us to 
increase the number of arguments indefinitely once the stage of n arguments 
is reached. (There is an overlapping inasmuch as the increase of e from n to 
n-+-1 may be effected either by that identity or by (6). Up to n+1 
arguments there are no algebraic relations between the skew-products of the 
arguments. ) 
For the orthogonal group (6) is replaced by the formula 


@ouf — Dey (Dyef) + 2—1—e)f. 


It works for e = —1 while Capelli’s general identity goes on from e = n. 
The gap = n—1-— is bridged by Capelli’s special identity which brings 
in the “ bracket factor.” 


3. The relationship 
(7) A= (E—S)(E = 


between finite and infinitesimal symplectic transformations, A and 8, shows 
that (2) defines an S-matrix in terms of two indeterminate S-matrices 9’, 8” 
with the parameters s;’,s;”. The parameters s; of 8 are rational functions 
Wj(s’,s”). Writing (2) in the form 


We 


Ww 


we 
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one finds at once 
(E+ 8)? = (2+ 8’) + 8’8”) + 


or 


E+S=(£+8")(£+ 8’). 
Hence the functions y;(s’, s’’) have the determinant 
A=|2+8'S” | 
as their common denominator, and 


(8) |-|B+8"|- a7. 


We derive the product matrix II;A in the tensor space P; from 
(9) A= S = L(s). 
Evidently it is of the following form 


(10) R(s) = 


where the numerator f(s) is a polynomial of formal degree nf of the para- 
meters s; with matric coefficients Cz. Thus op» runs over the monomials of 


degree = nf of the parameters s;. The same holds for II‘!)(A): 
p 
We maintain 


THEOREM II. The linear closure of (10), t.¢., the set Gy of all linear 
combinalions SrApCp with numerical coefficients rp, is an algebra containing 
the unit matria. 


The unit matrix is the coefficient Cy of the monomial op =1. Equation 


(2) entails 
E S E 9’ E 
| oe } - Hy | — 


or, because of (8), 


Crr = > 


After the substitution of y;(s’,s”) for s; each o, turns into a polynomial 
or(s’,s’”) divided by Hence 
Dod r 


We now apply the following trivial algebraic lemma: 


SSS 781 
= 
) 
) 
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Let ¢=1-+--- be a given polynomial of some variables z,,- - - , 2, 
with the constant term 1. Then the coefficients of an arbitrary polynomial F 
of degree m may be linearly expressed by the coefficients of 67 = G. 

Arranging the terms of F in ascending lexicographic order one obtains 
recursive linear equations for the unknown coefficients a of F, with the 
coefficients b of G as the known right members (only terms of @ of a degree 
not exceeding m enter). Denote by Fy the terms of degree » in any poly- 
nomial F = F, + F,+---. Putting ¢=1—vo and using the power series 


(1 —w)-*, one finds more explicitly 


F Gp Op, * 


(w203 


If ¢ has integral coefficients, then the coefficients of the linear combinations 
expressing the a in terms of the 6 are likewise integers. 

This lemma is of immediate application to the equation (11) in which A 
begins with the constant term 1. The products oy’oq” are monomials no two 
of which are equal, and thus we arrive at equations 

y 

= 

with rational integers y"pg. They prove our statement. The same equations 
hold for the matrices C,‘"), and thus the linear combinations SA,C,‘” form 
an algebra ©‘). The stage for the whole drama is the field x of rational 
numbers (or, as the arithmetician might care to observe, the even narrower 
ring of rational integers). 

4. We goon operating in x. The next step is 

THEOREM III. The algebra ©; in x is fully reducible. 


Suppose that a subspace {a,} of P; is spanned by a number of linearly 
independent vectors 
= * *, yn), (N =n‘). 


The orthogonal subspace {ay} is spanned by a complete set of linearly inde- 


pendent solutions z = a, of the simultaneous equations 


N 
= 0. 


i=1 
(The ranges of the two indices A and yp are disjunct.) In x the two subspaces 
have no vector in common except 0 and hence span the whole space; or the 
matrix U whose columns are the a) and dy is a non-singular square matrix. 
By construction the symmetric matrix U*U decomposes into a (A, A)- and a 
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(u,4)-part whereas the rectangles (A,w) and (y,A) are empty. We now 
assume the first subspace {a,} to be invariant under @;. This circumstance 
is expressed by an equation 

R(s)-U =U- Q(s) 
where Q(s) is reduced in the sense that its (A, »)-part is empty. We propose 
to prove that its (u,A)-part is also empty. 

To that end, multiply by U* to the left: 
U*R(s)U = (U*U)Q(s). 

We then see that the (A, »)-part of U*R(s)U is empty, 
(12) [U*R(s)T] —0, 


and that it is sufficient to establish the same fact for the (m,)-part of the 
same matrix. 
If S is an S-matrix, so is its transpose S*, The following simple linear 


involutorial substitution of the parameters sj, 
(13) s(aB) > s(Ba), >s(af’), 


changes the indeterminate S = L(s) into S*, hence A, (9), into A*, II;A into 
(11;A)*, and since | + 8 | E+ S*|, also R(s) into R*(s) and R*(s) 
back into R(s). Taking the transpose of the equation (12), 


[U*R*(s)U] = 9, 
and then carrying out the substitution (13), one arrives at the desired result 
[U*R(s) U == \), 


(13) induces a simple involutorial permutation among the monomials 


Op, 0p —> Op. Our argument shows that 


SC Op p Op* Op; ( p*- 


Thus the algebra GC; is a set © of matrices C which coincides with the set €* 
of its transposed elements C*. The method employed vields this general 
proposition: For any set © of matrices in a real field k with the property 
€* — € reduction results in full reduction if projection modulo the invariant 
subspace is carried out as projection upon the orthogonal subspace. This is 
in line with older investigations by E. Fischer who studied groups € of linear 
transformations in a real field # enjoying the property €* = € and showed that 
their invariants depending on a variable form have a finite integrity basis.’ 


Once in possession of our three theorems we put the same machinery into 


* Journal fiir reine u. ang, Mathematik, vol. 140 (1911), pp. 48-81. 
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play as employed on pp. 141-142 of CG. We still operate in the field x. An 
f(z, y,°-+) which is invariant under the generic (9) is necessarily an infini- 
tesimal invariant. Apply the hypothesis to t§ instead of S and take merely 
the first power of the parameter ¢ into account. Consequently Theorem | 
shows that the algebra described as %‘” on p. 174 is the commutator algebra 
of the commutator algebra of II‘? [(#—S8)/(£+8)] or of ©‘. Hence as 
©‘? is an algebra containing the unit matrix (Theorem II) and fully re- 
ducible (Theorem III), the algebra 2‘ is not wider than ©” (R. Brauer’s 
principle). In view of the definition of {‘) this fact remains true in any field 
of characteristic zero. Any polynomial (A) of degree f depending on the 
matrix A = || a; || with variable components a,, proceeds from a linear form 
y(A?) of an arbitrary bisymmetric by the substitution A‘? = II‘! (A), 
If (A) is annulled by the substitution (9) we must have y(C,‘”) 0, and 
therefore y(A‘?) vanishes for all matrices A‘! of 4°. Thus results Theorem 
(6.3. B) on p. 174 of CG. In restating it I propose the following terminology 


for ideals of polynomials Elements of a given 
ideal <3, whose degrees are fm respectively, are said to constitute a 


form basis of § if any element ¢ of <j of degree f may be obtained in the form 
d= hy, hndm 


where hi is a polynomial of degree f—f; (which implies, of course, the 
vanishing of hy whenever f—f; <0). By introduction of homogeneous 
codrdinates & gives rise to an ideal § of homogeneous forms $(2o, 21, °° , 21) 
such that $(2o,4,°*:,21) is in & if and only if $(1,21,---,71) is in %. 
The statement “¢,,°°-,¢m constitute a form basis of §” means that the 
corresponding homogeneous forms ¢;,°°-,¢m of degrees f1,°-*, fm constitute 
a basis of &. 

We arrive at the following proposition concerning the symplectic ideal 
(i. e., the ideal (1) of the symplectic group [) which holds in any field k 
of characteristic 0: 


THEOREM IV. 1) Let A=|| ai || be the matrix with n? variable com- 
ponents aix. The components of the two matrices A*JA —I and AIA* —I 
constitute a form basis, the components of either of them a basis, of the 
symplectic ideal. 

2) The symplectic ideal is a prime ideal, and the expression (9) a generic 
zero; 1.€., a polynomial (A) vanishes for all symplectic transformations A 
if it is annulled by the substitution (9). 


INSTITUTE FOR ADVANCED STupy, 
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AN ALGEBRAIC PROOF OF A PROPERTY OF LIE GROUPS.* 


By CLAUDE CHEVALLEY. 


Let % be a Lie group with complex parameters, and let ¥ be its Lie 
algebra. To every element X « ¥ there corresponds an infinitesimal trans- 
formation of the adjoint group, which we shall denote by X; X is a linear 
endomorphism of &, defined by the formula 


XY =[Y,X] for every Ye ¥. 


If we select a base in &, every X may be represented by a matrix. We denote 
by J the smallest integer such that & contains elements XY for which 0 is an 
l-uple characteristic root of X. Such elements X are said to be regular. 

E. Cartan has shown? that every regular element X belongs to a uniquely 
determined nilpotent sub-algebra? 2 of dimension I of &. The algebra N 
is composed of those elements Y e ¥ for which X'Y —0. We propose to call 
such a nilpotent sub-algebra a Cartan algebra of the Lie algebra &. 

Any automorphism of & permutes among themselves the Cartan alge- 
bras of &. This applies, in particular, to the operations of the adjoint group. 


We want to prove the following result: 


TuHeEorREM. Jf Nl, and N, are any two Cartan algebras of the Lie algebra 
¥, there always exists an operation of the adjoint group which carries No 
onto N. 

This theorem is well known in the case where ¥& is semi-simple.* It turns 
out that the proof, when translated in algebraic terminology, may be extended 
so as to include the general case. 

The possibility of doing this is somehow surprising at first sight, since 
the adjoint group itself is not necessarily algebraic. However, the success of 


* Received May 22, 1941. 

1Cf. Cartan, Thése, Paris, 1894, or Zassenhaus, “ Uber Lie’sche Ringe mit Prim- 
zahlcharakteristik,’” Abhandlungen aus dem Mathematischen Seminar der Hansischen 
Universitdét, vol. 13 (1939), pp. 1-100. 

* The nilpoint algebras are also called “ groups of rank 0.” 

®Cf. H. Weyl, “ Theorie der Darstellung kontinuierlicher halb-einfacher gruppen 
durch lineare Transformationen, part III,” Mathematische Zeitschrift, vol. 24 (1925-26), 
pp. 377-395. For a proof of the theorem in the case of compact Lie groups, ef. André 
Weil, “Démonstration topologique d'un théoréme fondamental de Cartan,’ Comptes 
Rendus de l’ Académie des Sciences, vol. 200 (1935), p. 518. Although his proof is based 
on topological considerations, it is in some ways related to the proof which is given here. 
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our method depends on the fact that the adjoint group contains an algebraic 
sub-group which has already sufficiently many elements to transform any given 
Cartan algebra into any other. 

Before entering into the details, we remark that the Cartan algebras play 
in the theory of Lie algebras the same réle that the Sylow sub-groups play in 
the theory of finite groups. Our theorem corresponds to the fact that the 
Sylow sub-groups of a finite group (corresponding to a prime p) are all 


conjugate sub-groups. 


I. A lemma of algebraic geometry. Let = be a field of algebraic 
functions over an algebraically closed field Q. Let us select any finite number 
r of functions é,, é:,---,é- in %; we build the algebraic manifold UV whose 
“generic point” * is (1, Such a manifold will always be called a 
“ model ” of &, even in the case where 0(é,, &,---,&,-) is a proper sub-field 
of S. If we introduce s new functions 7, y2,°-*,7s of %, we can construct a 
new model UY, of whose generic point is (&, ,&3 m1) 92)" 
If (€;7) is any point of V,, the point (€) is clearly a point of UV, which we 
call the projection on V of the point (€,7). 


Lemma 1. Let there be given a finite system MM, of proper algebraic 
sub-manifolds of V,. There exists a finite system % of proper algebraic 
sub-manifolds of UV such that every point of UV which does not belong to In 
is the projection of a point of VU, not belonging to M,. 


We shall first consider the case where s 1, and we shall set 7, —7. 
There exists, by hypothesis, a polynomial P(2,,%2,---,2,-;y) for which 
P(&, &,° +, & 3) 540, but such that P vanishes at every point of 9M. 
Let us consider separately two cases. 


1. The function y is algebraic over O(€,, &,---,&-). Then there exists 
an irreducible polynomial $(2,, 22,°--,2,;y) which actually contains y and 
is such that $(é1, = 0. Moreover, if is any 
polynomial which vanishes on Vi, considered as a poly- 
nomial with coefficients in is divisible by $(€1, 
It follows that there exists a polynomial @(2,,- - -,2z,) for which 


@(£1, €2,° &r) 0 
and such that 


* For the notions of an algebraic manifold and of a generic point, cf. v. d. Waerden, 
Vorlesungen iiber algebraische Geometrie (Springer). 


ic 
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where *,%r3y) is a polynomial. 
The variety VU, is determined by a prime ideal of polynomials in 2g, y; 
we select a base ¥,,W2,- - +, of this ideal, and, for each one of the poly- 


nomials y;, we write an equation of the type (1). Let ©,(r),@.(r),---, On(z) 

be the polynomials which occur in the left-hand sides of these equations. 
The polynomial P(é,---,&-3y) cannot be divisible by $(&,°--,&,y), 

because we would otherwise have = 9. Since ¢ is irreducible, 


there exist polynomials 


such that 
AP+Bo=C, 


We set @,,, = (C, and we denote by ©n,.(€) the coefficient of the highest 
h+2 

power of y in ¢(é;y). Let @(z) be the polynomial [] ©;(2). Then we have 
i=1 

@(é,,°°:,&) #0 so that the equation © =0 determines on V a finite system 

Mm of proper sub-manifolds. Let (é:,°°*,&-) be a point of VY — MM. Since 

@r.2.(€) 40, the equation $(€,y) 0 has at least one solution 7. Since 

@(é) ~0 (1Six<h), the equations (1) show that 7) =0 

(1 ish); hence the point (&,---,&,,7) belongs to Since 

= #0, and (é,7) =0, 

we have P(é;7) 0, and the point (€;7) does not belong to %,: the lemma 

is proved in this case. 

2. Suppose now that 7 is transcendant over 0(&,---,&-). Let 
@(é,,: - -,&-) be the coefficient of the highest power of y in P(é;y). We 
have @(é,,- -,&-) 0; hence the equation determines a finite 
system % of proper sub-manifolds of VU. If (&,:-:-°,&-) is a point of 
VY — M, there exists a number 7 such that P(é,7) 0, and the point 
(é,7) belongs to U,— %M,. Therefore the lemma is entirely proved for 


s=1, 
If s >1, we denote by UVs-is: the model whose generic point is 


we have Ves. = V, and, for is, we find the model already denoted by 
%Y,. Moreover V i. is obtained from VU; by adjoining a single new codrdi- 
nate. Hence we can determine step by step in each Vj; a finite system 2; 
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of proper sub-manifolds such that 9, is the given system and that every 
point of Ui:— IM; is the projection of a point of VUir— Min. It is 
sufficient to take 9 = Ms5,,, which completes the proof of the lemma. 


Remark. This lemma is very closely related to the theory of relation-true 
specializations of v. d. Waerden. 


II. The manifold of the Cartan algebras. We consider a Lie algebra 
¥ over an algebraically closed field ©, and we select a base {Z,, L2,--+ , Ln} of 
over Q. Let us, Un be indeterminates, algebraically independent 
over ©. We construct the field Q(w) obtained by adjunction of the u,’s to Q. 
The linear combinations of the elements of ¥ with coefficients in Q(u) form 
a Lie algebra #,. We denote by LZ, the element SwiLie Lu, which we may 
consider as a generic element of &. As an element of @y, it is obviously 
regular. We denote by L, the matrix which represents LZ, in the adjoint 


representation of #,; hence we have 
Lu] = Ly = is 1=n), 


where the A;;(w) are linear forms whose coefficients are the constants of 
structure of @. The matrix L, has exactly | characteristic roots equal to 0, 
Therefore, we can select / linearly independent elements My,,,- - +, Mu,1 of 
such that (1=iS1); these elements form a base of a 
Cartan algebra Ny of &u, which is the only Cartan algebra containing Ly. 
We may call Tl, a generic Cartan algebra of ¥. The coefficients of L,! 
being homogeneous polynomials of degree J in the parameters wu, we may 
assume that the codrdinates of the elements M,,; are homogeneous functions 
of the w’s. 

We shall now introduce the Pliickerian codrdinates of an /-dimensional 
vector sub-space 9m of Let Mj = (1 constitute a base 
for 9. We construct over & a Grassmann algebra, in which we denote the 
operation of multiplication by the sign 0). We can write 


where » stands for any arrangement {1,,%2,---+,%:} of 1 of the indices 
1,2,---,n, and Va=Li,0,0°-:-OLi, If we furthermore require 
that é, shall be a skew-symmetric function of the indices which enter in ph, 
the coefficients are entirely determined when M,, - Mi are given. 
Moreover, it is well known that they are all multiplied by a same factor ~0 
if we replace the base {M,, Mz,- - -,Mi} by any other base of 9. Hence 
the numbers € may be considered as giving a system of homogeneous 
coordinates of 9: they are called the Pliickerian codrdinates of In. 


rue 
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Let us now form the Pliickerian codrdinates =,(u) of the generic Cartan 
algebra 11. We may consider them as the coérdinates of a generic point of 
an irreducible projective algebraic manifold. This manifold shall be denoted 
by [M1], and we shall call it the manifold of the Cartan algebras. 


LemMMA 2. If No is any Cartan algebra of &, the Pliickerian coérdi- 
nates of Nl, are the codrdinates of a point of [N]. 


In fact, let us select in > a regular element Ly = 3u,°L;. The matrix 
(L,)* has exactly / characteristic roots equal to zero. Therefore, it has a minor 
A, of degree n —/ which is ~0. Let A(w) be the minor of L,' formed of the 
same lines and columns as Ay. We have A(u.) +0, whence A(u) 0. From 
the elementary theory of the resolution of linear equations, it follows that the 
basic elements M,,; may be selected in such a way that 


a) The elements A(u)M,,; have codrdinates which are polynomials in 
the w’s. 

b) There exists a system of J indices {%,,%,- + -+,%} such that the 
coefficient of Li, in Mu is (i.e, LifA—p, 0 ifAAp), (1SaA, 


From the condition a), it follows that we can make the substitution 
u—> U, in the codrdinates of the elements M,,;; we obtain in this way / ele- 
ments My; Since we have (L,)'My; = 0, we have = 0, whence 
M,,i4¢N>. From condition b), it follows that the 1 elements M>,; form a base 


for No. 
Let us now form a system of Pliickerian codrdinates of 1, by means of 
the base {My1,° Let be the codrdinates obtained in this 


way. It is clear that A’(w)=y(w) is a polynomial in the quantities u, and 
that the Z,(wo) are the Pliickerian codrdinates of ‘1, constructed by means 
of the base {Mo.,° -,Mo,1}. Since the quantities obviously satisfy 
any homogeneous algebraic equation which is satisfied by the Eyu(u), they are 
the codrdinates of a point of [7]: Lemma 2 is proved. 

We shall see later that, conversely, the codrdinates of a point of [N] are 
the codrdinates of some Cartan algebra of &, provided we exclude the points 
which lie on a certain number of proper sub-manifolds of [1]. 


Lemma 3. The dimension of [N] is at most n—l. 


Let us introduce new parameters algebraically inde- 
pendent of the w’s. We set 


l n 
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where the V;’s are linear forms in 1, V2,°-+, v; with coefficients in Q(u). The 
element is clearly a regular element of the Lie algebra = Q(u, v) 
hence it belongs to a Cartan sub-algebra Nu» of Lu», whose Pliickerian 
codrdinates are the quantities V2,---,Vn). On the other hand, Lu,» 
is contained in Q(u,v)N,, which is obviously also a Cartan sub-algebra of 
Hence the quantities =,.(V), are proportional. 

The field & of rational functions on [1] is the sub-field of Q(u) com- 
posed of the homogeneous functions of degree 0 of the Zu(u). We claim that 
we can find / of the variables w which are algebraically independent over %. 
In fact, since the My, are linearly independent, we can find / of the forms 
Vx, say Vi,» Vioy***, Ve, Which are linearly independent with respect to the v’s. 
If we had a relation 


where F is a polynomial with coefficients in 3, we would also have 
F (Vin, View’ Vex) = 9, 


since any element of & is left unchanged by the substitution w%,— V;. But 
this is impossible, since the forms V;,,- - -, Vx, being linearly independent 
over Q(w), are also algebraically independent over this field. 

Since Q(u) is of degree of transcendence n over 2,'S cannot be of a 
degree of transcendence > n — ! over Q, which proves Lemma 3. 


III. The proof of the theorem. Let 7, be any Cartan algebra of ¥&, 
If § is any element of No, the characteristic equation of S has, besides the 
i-uple root 0, other roots a(S), *,@n-1(S) (not necessarily 
distinct). These roots are linear functions of S; none of them is equal to 0 
if S is regular. If a(S) is a multiple root of order m for every SeNo, we 
can assign to « a system of m elements Fai,° - -, Ham of & such that the 
following equalities hold for all Se No: 


=«(8)Ea1; 8] = Eas + Bi2(9) Eos, 
[ Bam; S] a(S) Lam + Bm-1,m(S) Ba m-1 + + Biym(S) 


If we do this for every root a, the »—/ elements F,,; together with 1 basic 
elements of To, constitute a base for ¥.° 

It follows that the base {Z,, L.,: - -, Zn} may be chosen in such a way 
that: 


5 Cf. loc. cit.? 
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1) Dn constitute a base of Ny; 
2) LIn-t. is a regular element of Ny; 
3) If 1=i=n—l, each is one of the elements 
4) We have 
[Li, S] = a(S) Li Bii(S) Li 


for 1S ix=n—l, SeNo, where a;(S) is one of the roots «(S). We say 
that Li “ belongs ” to the root a(S). 

We know also that, if Z; belongs to the root a;, the same holds for every 
[Li, 8S], if SeNo; and that, if Li, L; belong to the roots ai,a;, [Li, Ly] 
belongs to the root a(S) + ;(S) if a(S) + a;(S) is a root and is equal 
to zero otherwise. From this fact and from the fact that there are only a finite 
number of roots, it follows at once that L; is a ni'potent matrix. 

Let us introduce new variables 21, 2%2,° *,Zn-1. Since L; is nil- 


potent, the coefficients of the matrix 
foe) 
exp = > (1/k!)r*L* 
k=0 
are polynomials in the a. We set 
o(%1,° =o(x) = (exp 2,L,) (exp (exp 


The matrix o(x) belongs to the adjoint group of & and its coefficients are 

Since o(a) is an operation of the adjoint group, it produces an auto- 
morphism of @, and o(x)MN> is a new Cartan algebra Nz (we assume that 
the 2’s have been adjoined to 2). A base of Nz is composed of the elements 
o(2) (1 Sh let E(x) be the Pliickerian codrdinates of Nz con- 


polynomials in 2, 


structed by means of this base. If we give to the quantities x any values in Q, 
the é.(z) become the coédrdinates of a point of [M1]. It follows that the 
polynomials é.(2) themselves are the codrdinates of a point of [M1] depending 
rationally on the parameters w. 

We shall now prove that the point (€()) is (1 —1)-dimensional. In order 
to prove this, we denote by y; the codrdinate & for which yw is the arrange- 
ment composed of the indices (i,n—1+2,---,n) (LSiSn—I+1). 
It will be sufficient to prove that the functional determinant of the n —1 
functions - (1SisSn—l) with respect to 2%, %2,° +, is ~O 

(x) 

We denote by [a]? the ideal of the polynomials in 2, %2,° * +, %n-1 

generated by the (1Si,j7Sn—l). We have 
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(2%) Dn-t4n = — Li 
— (Dn-i.n) Lj (mod [2]?) 


where 
Bye = 0 if 


The quantities é,(2) are obtained by expanding the external product 
It follows that we have 
(l1Sisn—l). 
Hence, the value for x0 of our functional determinant is equal to 
(—1)""? : it is different from zero because Dn_1,, is regular. 
The manifold [N] is of dimension = n—J1 by Lemma 3, and has a 


point (é.(z)) of dimension n—J/. Hence: 


Lemma 4, The manifold of the Cartan algebras 1s of dimension n—I 
and the point (€.(x)) is a generic point of this manifold. 


We can now prove the following 


Lemma 5. If we remove from [Nl] the points of a certain system Mo 
of algebraic manifolds of dimensions < n—l, the codrdinates of any remain- 
ing point of [N] are the Pliickerian coédrdinates of a Cartan algebra N which 
is the transform of Ny by an operation of the form o(z). 


In fact, let us consider the manifold 9% whose generic point has for 


codrdinates (21, *,@n-13 where é,,(2) is the polynomial 
denoted above by (i. ¢€., wo is the arrangement 


We may consider the éu(z) /é,,(z) as the coordinates of a point of an 
affine algebraic manifold [NM ]*. From Lemma 1, it follows that we can find 
a finite system 9*, of sub-manifolds of [%]* such that every point of 
[N]* — In*, is the projection on [M1 ]* of a point of KH. We denote by Mp 
the set of the points () of [2] which are such that either €,,—0 or the 
point of codrdinates lies in Let be the codrdinates of a 
point of [%]— M,; then we have &,>40, and there exists a system 
42,° * *,@n-1) of elements of 2 such that 


Eu(@) = 
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It follows immediately that the €, are the Pliickerian codrdinates of the 
Cartan algebra o(a)No. 

We can now prove our theorem. Let 7, be any other Cartan algebra of 
¢. The manifold [7] contains also a finite system 9%, of manifolds of 
dimension < n—1 such that every point of [%7]— 9, represents a Cartan 
algebra which may be obtained from 71, by an operation of the adjoint group. 

We know that these are points on [7] which do not belong either to Mp 
or to %N,. Such a point represents a Cartan algebra which may be obtained 
from both 71, and 1, by operations oo,0, of the adjoint group. The operation 
o, 0, transforms into which proves the theorem. 


Remark. The elements of the adjoint group which we used are elements 
of the sub-group generated by those infinitesimal transformations of the 
adjoint group which are nilpotent. It can be seen without much difficulty 
that this group is always an algebraic distinguished sub-group of the ‘adjoint 
group. It coincides with the adjoint group in the case where the latter is 
either semi-simple or nilpotent. In other cases, it may actually be a proper 
sub-group. However, it always contains the commutator-sub-group of the 


adjoint group. 


PRINCETON UNIVERSITY. 


®T am indebted to André Weil for this remark, as well as for several others. 
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ON THE ERGODIC DYNAMICS OF ALMOST PERIODIC 
SYSTEMS.* 


By NorsBert WIENER and AUREL WINTNER.** 


> developments in celestial 


Introduction. The classical “ trigonometric’ 
mechanics,’ that is, the developments to which the whole of the second volume 
of the Méthodes Nouvelles is devoted, are purely formal in nature. It was 
shown by Birkhoff? that the successive stages of the process defining these 
expansions are subject on any finite time range to certain estimates of the 
order of magnitude; estimates which, being exactly of the same type as those 
underlying his theory of non-linear difference equations, can be adapted to 
local proofs of existence of periodic motions. Correspondingly, the considera- 
tions of the astronomers and the investigations of Poincaré and of Birkhoff 
do not attempt to prove that there exist trigonometric expansions, say 

F(t) aye t, (Ax for k 7), 
k 
which are Fourier series, representing for the function F(¢) an anharmonic 
analysis in a sense suggested by Poincaré himself ;* namely, in the sense that 
0 if for every k, 


T 
lim eth (t) dt = 
2T 
—T 
where p» is any real number. 

By a recondite connection of Diophantine intricacy, the problem must 
somehow depend on the celebrated small divisors in celestial mechanics.‘ 
On the other hand, it turned out that, by virtue of a property of uniformly 
almost periodic angular variables, the Diophantine small divisors prove to be 


* Received March 28, 1941. 

** Fellow of the Guggenheim Foundation. 

1 The classical literature of the subject is collected in R. Marcolongo’s bibliography, 
Il problema dei tre corpi (Manuali Hoepli, nos. 403-405), 1919, pp. 61-79. For more 
recent references cf. A. Wintner, The Analytical Foundations of Celestial Mechanics, 
Princeton, 1941, § 523. 

2 For a short presentation, cf. G. D. Birkhoff, Dynamical Systems, New York, 1927, 
Chap. III-Chap. IV, where references are given to the original papers. 

3H. Poincaré, “ Sur la convergence des séries trigonométriques,” Bulletin Astro- 
nomique, vol. 1 (1884), pp. 319-327. 

* Cf. loc. cit.1, § 523—§ 529. 
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harmless, not only in the integrable case of Liouville systems,> but also in 
such cases (by no means trivial, though relatively simple) as Hill’s theory of 
the perigee® or Adams’ theory of the lunar node.” This suggested that the 
treatment of the general problem might be attempted if Bohr’s theory of 
uniformly almost periodic functions, which deals indeed with a situation quite 
degenerate from the dynamical (or, equivalently, topological) point of view, 
were replaced by the only result of real generality established for the solutions 
of dynamical systems; namely, by the ergodic theorem of Birkhoff.® 

In this direction, it was shown, first in the case of Lagrange’s problem 
of mean motions® and then,’° via the Fourier integral theory of auto- 
correlations, for any metrically transitive system, that Birkhoff’s ergodic 
theorem actually leads to the existence of an anharmonic analysis, as defined 
by the above pair of formulae. The case of an arbitrary measure-preserving 
flow can be reduced to the metrically transitive case, if use is made of von 
Neumann’s transfinite decomposition of the flow into its irreducible com- 
ponents.’? In this sense, the existence problem for an anharmonic analysis, 
as formulated above, can be considered as solved in the general case. 

However, this general result does not supply the answer to the more 
specific dynamical question which concerns not merely the existence of an 
anharmonic analysis but also the problem whether the resulting Fourier series 
of the function F(t) does or does not converge to F(t) in the mean 
(— «0 <t< oo). Obviously, the answer to this specific question of com- 
pleteness cannot be in the affirmative in the general case. For instance, if a 
surface of constant negative curvature is of finite connectivity and of finite 
area, then its geodesic flow is, according to Hedlund,"* a mixture, and so it is 
easy to see that the anharmonic analysis cannot satisfy the condition of 


completeness. 


5 Tbid., § 194-§ 198. Ibid., § 520-§ 522. 7 Ibid., § 484-§ 487. 

8G. D. Birkhoff, “ Proof of a recurrence theorem for strongly transitive systems,” 
“ Proof of the ergodic theorem,” Proceedings of the National Academy of Sciences, vol. 
17 (1931), pp. 650-655, 656-660. 

® A, Wintner, “ On an ergodic analysis of the remainder term of mean motions,” 
Tbid., vol. 26 (1940), pp. 126-128. 

10N. Wiener and A. Wintner, “ Harmonic analysis and ergodic theory,” American 
Journal of Mathematics, vol. 63 (1941), pp. 415-426. 

11N. Wiener, The Fourier Integral and Certain of its Applications, Cambridge, 
1933, § 20-—§ 23. 

12 J. v. Neumann, “ Zur Operatorentheorie in der klassischen Mechanik,” Annals of 
Mathematics, vol. 33 (1932), pp. 587-642 and 789-791. 

#®G, A. Hedlund, “ Fuchsian groups and mixtures,” ibid., vol. 40 (1939), pp. 
370-383. 
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The object of the present paper is to delimit the class of those flows on 
which the condition of completeness is satisfied, i. e., on which the anharmonic 
analysis (in the sense defined above) becomes a Fourier analysis (in the usual 


sense of the word). 


1. In what follows, the existence of an average, M:{g(t)}, of a function 
g(t), —2 <t< o, will be meant in the following sense: g(?) is of class 


(LZ) on every bounded f-interval and there exists a finite limit 


B 
(1) M{g(t)} = im mais g(t)dt 


(in other words, 


B 0 
1 { 1 f 
— t) dt 
B. g(t)dt and RB g(t)dt 
0 —B 


tend to a common finite limit as B— «). It will be very important that 
the limit process occurring in the definition of M+{g(t)} is the one given 


under (1), and not the limit process 
(1 bis) B—A>” 


which is compatible with B—>— o or A—o. In fact, while Birkhoff’s 
ergodic theorem * holds (for almost all points P of the phase space) if its 
statement is formulated in terms of the average definition (1), it does not," 
in general, hold (for almost all P) if the average is referred to the unre- 
stricted limit process (1 bis). On the other hand, it is well-known that the 
mean ergodic theorem,’® resulting by integration with respect to P over the 
phase space, is sufficiently rough to hold even if the t-average involved belongs 
to the unrestricted limit process (1 bis). The method of proof in §8 will 


depend precisely on this discrepancy between the two ergodic theorems. 


2. Let f(t),— 0 <t< o, be of class (LZ) on every bounded t-interval. 
For a fixed real number z, the z-th Fourier constant of f is defined by 
(2) a(x) = }, 


provided that this average exists. If a(x) exists, a(—z) need not exist, 


since it is not assumed that f—f. If (2) exists for every x, the function f(t) 


14 Examples to this effect can be constructed from a perusal of the proof of the 
dominated ergodic theorem of N. Wiener, “ The ergodic theorem,” Duke Mathematical 
Journal, vol. 5 (1939), pp. 1-18. 

1° Cf. G. Birkhoff, “ The mean ergodic theorem,” ibid., pp. 19-20. 
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will be said to possess an amplitude function, a(2). The amplitude function 
a(z), —0o <x< o, if it exists, is a rather discontinuous function in the 
relevant cases. 

Let f(t), — 0 <t< o, be of class (LZ?) on every bounded ¢-interval. 
If the average 
(3) o(s) Miff(t + 8)f(t))} 
exists for every real number s, and if c(s) is a continuous function, f(¢) 
is said to possess a correlation function, c(s). 

It is known"! that every function c(s), — 0 <s< #, which is a 
correlation function (for a suitable f) is the Fourier-Stieltjes transform, 


—oo 
of a uon-decreasing bounded function ¢(2), — o, the “ periodo- 


gram ”'° of f(t). If the monotone function ¢ is thought of as normalized, 
for instance by 
(4bis) ¢(—o)—0 and ¢(a—0)—¢(2), 


then, according to the uniqueness theorem of the Fourier-Stieltjes transform, 
the periodogram ¢ is uniquely determined by the function (4) and therefore, 
via (3), by the function f (on the other hand, the function (3) does not, of 
course, determine the function f). It may be mentioned that there exists to 
every non-decreasing bounded function ¢ a function f possessing the given ¢ 
as periodogram."* 

It is well-known’® that the Fourier-Stieltjes transform, (4), of any 


non-decreasing bounded function, ¢, has the amplitude function 
(5) M,{e-'*8c(s)} = o(2 + 0) 0) 


(so that, in particular, 
(5 bis) M,{e-***c(s)} 20 

16 This nomenclature, which we now propose in order to eliminate the confusion 
that both of us had a definition of “spectrum” (ef. loc. cit.°, the fourth and fifth 
footnotes), differs from Schuster’s terminology in the theory of hidden periodicities only 
insofar as Schuster, having been interested only in cases where ¢(#) is a step function, 
has called the jump (20) of ¢ at w (that is, the “ histogram” of @), and not ¢ itself, 
the periodogram of f. 

17 N. Wiener and A, Wintner, “ On singular distributions,” Journal of Mathematics 
and Physics (M.1I.T.), vol. 17 (1938), pp. 233-246, § 5. 

18 This useful elementary lemma, found by P. Lévy in his Calcul des probabilités, 
Paris, 1925, pp. 169-172, has its historical origin in physical optics. 
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for — 0 <a2< o). Clearly, (5) remains unchanged if the function 
occurring in (4) is replaced by its purely discontinuous component [which 
can be = 0]; that is, by the non-decreasing bounded function defined for 
ow by 


(6) + 0) —$(an— 0)), 
where 2;,%2,° * * is any sequence containing all discontinuity points of ¢. 


(x) is called a step function if it is identical with the function (6). 
In this case, and only in this case, the bounded, uniformly continuous func- 
tion (4), which is almost periodic (5°) even if ¢(z) has a (continuous) 
singular and/or absolutely continuous component, is a uniformly almost 


periodic function. 


3. Let f(t), — 0 <t< o, be any function which has a correlation 
function, (3), and is such that the z-th Fourier constant, (2), exists for a 
fixed x. 


Under these assumptions, it is easy to verify that the function of ¢ defined 
(for the fixed value of 7) by 


(7) fl) (t) f(t) —a(a)et 
possesses a correlation function, 
(8) (t + 8) f(t)}, 


which turns out to be the function 


(9) c'")(s) =c(s) —| a(x) 


+ 


and that the z-th Fourier constant of this function of s exists and is 


represented by 
(10) (s) } o(x +- 0) — ¢(x— 0) | a(x) ie 


where ¢ denotes the periodogram of f(t). 
In fact, since (3) exists for every s, substitution of (7) into (8) gives 


= c(s) — a(x) (t) — a(x) Miff(t + s) ett} + | a(x) 
provided that the averages 
Mi {f(t)et#*} and M,{f(t + = Mi{f(t) 


exist. But they exist, and are represented by d(2) and a(x) e*** respectively, 
since (2) is supposed to exist for the given value of z. Accordingly, 


= c(s) —a(x)a(x) et + | a(x) 
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Since this reduces to (9), and since (9) and (5) imply (10), the proof is 
complete. 

Its trivial character notwithstanding, the result of this simple calculation 
will be fundamental in the sequel, for the following reason: (10) can be 
interpreted as a commutation rule, expressing the deviation which results by 
subjecting the function e-twsF (t) f(t +s) of the two variables s,t (where x 1s 
fived) to the two iterated average operators M:M,s, MsM:; the error committed 
by the interchange of the two limit processes M:,Ms having precisely the 
value (10). 


In fact, from (2), where z is fixed, 

| a(x) |? = Me (t)} Mo (s)} == Me (8) }} 
so that, since M,{e-‘**f(s)} = + s)} for every 

| a (ar) |? — Me (F(t) (t + 8)}} + 8)F(t)}). 
On the other hand, substitution of (3) into (5) gives 

+ 0) —$(x—0) = M, {f(t + s)f(t)}} 

= (t + s)f(t)}}. 

Hence, (10) can be written as a commutator, 
(10 bis) — MiM, Je-*#*f (t + 8) f(t). 


In particular 


(111) Ma{ Mi {er#*f (t + 8) f(t) }} — Ma (t + 8) f(t) }} 
holds, for the given value of 2, if and only if 


(11,) M, =0 
for the same z. 


The above results can be restated as follows: 


THEOREM 1. Jf a function f(t), —o <t< o, has a correlation 
function, (3), and if the x-th Fourier constant, (2), of f(t) exists for a fixed 
t, then the periodogram, $, of f(t) is subject, at the given point x, to the 
inequality 


(12) | a(x) |? S + 0) 0), 


where the sign of equality holds if and only if the commutability condition 
(11,) is satisfied for the given z. 


irs 
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In fact, since (7) has a correlation function, (8), it has a periodogram, 
say $'*) = — 0 <y< ~; so that, corresponding to (4), 


(s) eivsd g(a) (y). 
—oo 
On applying the corollary (5 bis) of (5) to this Fourier-Stieltjes transform 
instead of to (4), one obtains the inequality 


M, (s)} = 0 


for <y< o. The particular case of this inequality, when 
combined with the identity (10), implies Theorem 1, since (11.) has been 
seen to be equivalent to (11,). 


4. The proof of Theorem 3 (concerning almost periodic (B*) functions; 
cf. §5 below) will depend (§6) on Theorem 1 alone. On the other hand, 
before Theorem 3 becomes applicable to the ergodic problem, it will (in § 9) 
be necessary to use, besides Theorem 3, a Tauberian counterpart of Theorem 1, 


namely the following result: 


THEOREM 2. If a bounded function f(t),—_« <t< om, has a corre- 
lation function, (3)—(4), then the a-th Fourier constant, (2), of f(t) exists 
and vanishes for all those values x which are continuity points of the 
periodogram, (x), of f(t). 

The Tauberian element is represented by the assumption | f(t) | < const. 
of Theorem 2. 

Loc. cit..°, the proof of Theorem 2 was given for sequences, instead of 
for functions, but it was pointed out there that the proof applies without 
change to the case of functions as well. It is clear also from the proof that 
the assumption | f(t)| < const. of Theorem 2 can be replaced by other, more 
general, Tauberian conditions; for instance, by the restriction f(t) = 0, if 
f(t) is real-valued. However, it is undecided whether or not the assertion of 


Theorem 2 is true for every function possessing a correlation function. 


5. Let the almost periodicity (B*) of a function f(t), —wx <t< ow, 
be meant in the sense that the trigonometric approximability in quadratic 
mean refers to the definition (1) of the average. 

It will be shown that, in terms of the notions considered in § 2 and § 3, 
the almost periodicity (B?) of a function f(t) can be characterized, without 


19 N. Wiener and A. Wintner, loc. cit.1°, Lemma. 


n 
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any explicit reference to the notions of approximation, translation or com- 


pactness, as follows: 


TuHeEorEM 3. A function f(t), —20 <t< ©, is almost periodic (B?) 
if and only if it has 


(I) an amplitude function ; 

(II) a correlation function ; 
(III) a periodogram which is a step function; 
(IV) the commutability property (11,) for every z. 


As an illustration, let f(¢) sin | Then it is easily verified that 
M,{| f(t) |*?} #0, and that the functions (2), (3) exist and reduce to con- 
stants: a(x) =0, c(s) =M;{| f(t)|?}. Since c(s) = const. means, by (4), 
that (x) is a constant multiple of the step function sgn a, it follows that 
(1), (11), (III) are satisfied. But f(t) is not almost periodic (B*), since 
M,{| f(t)|?} #0 and the Parseval relation are at variance with a(z) = 0. 

Accordingly, the conditions (1), (II), (III) together are not sufficient 
for almost periodicity (B*) ; that they are necessary, is quite straightforward 
(cf. the beginning of § 6 below). Thus the emphasis in Theorem 2 is on the 


form of the additional condition, (IV). 

That (IV) becomes, in virtue of (I), (II), (III), equivalent to the 
definition of almost periodicity (B*), is not as surprising as appears at first 
glance. In fact, the validity of (11,) for every x is equivalent to the assump- 
tion that (11.) is an identity in x But (11.) can be thought of as an 
orthogonality relation (for every fixed 2). Then the criterion represented by 
the set of the four conditions (I)—-(IV) corresponds to the definition of the 
almost periodic class (B*) (that is, to the condition of trigonometric approxi- 
mability in quadratic mean) in the same way that the notion of a closed 
sequence of orthogonal functions relates to the equivalent notion of a complete 
sequence of orthogonal functions in the Hilbert space of the functions f(t), 
0=t=1, of class (L7). Of course, the situation is more delicate in the 
present case, since the integrals become replaced by averages and, corre- 
spondingly, the index a cannot be restricted to an enumerable set which is 


independent of /. 


6. In order to prove Theorem 3, suppose first that f(t) is almost 
periodic (B*). Then (2) exists for every z, and vanishes except when 2 
belongs to a set which is enumerable (at most). If 2, %2,- + - is a sequence 
of distinct numbers containing this set, then, by the Parseval relation, 


(13) | |? < 


n 


m 
d, 
)) 
ts 
le 
yf 
it 
it 
re 
if 
of 
3, 
it 
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In these notations, the Fourier expansion (B*) of f(t) is 


f(t) ~ 3 


n 
Furthermore, if s is arbitrarily fixed, the translated function, f(t +s), of ¢ 
is almost periodic (B*), and 


f(t +s)~2a 


Hence, by the polarized form of the Parseval relation, 


Milf (t +s) f(t)} | 
n 
This function of s is, by (12), continuous (and, as a matter of fact, uniformly 
almost periodic), and appears in the Fourier-Stieltjes form (4) if one puts 


(14) = |a(an)|?; 


it being assured by (13) that (14) defines a non-decreasing bounded function 
for — 0 <2< o. Furthermore, (14) can be written in the form (6), 


where 


(15) +0) —$(a—0) =| 


so that ¢(z) is purely discontinuous. 

This completes the proof of the necessity of the conditions (I), (II), 
(III), if f(¢) is almost periodic (B?). 

Next, suppose that a given f(t), which need not be almost periodic (B*), 
satisfies the three conditions (I), (II), (III). The two conditions (I), (II) 
together can be expressed by saying that the assumptions of Theorem 1 are 
satisfied not for a fixed x but for every x; while condition (III) means that 
there exists a sequence of distinct numbers 2, 22,--- by means of which the 
non-decreasing bounded function ¢(7), — implicitly defined by 
(3), (4) and (4 bis), can be represented in the form (6). 

Under these assumptions, define, for every positive integer N, a function 
Iv(t), <t< o, by placing 


(16) fu(t) = f(t) — a(an)et 


Since (7) has a correlation function (8) which reduces to (9), it follows from 


the orthogonality relation 


(17) Mz {ettte-ivt} 


by induction from N —1 to N, that (16) has a correlation function 


- 
| 
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(18) = Me{fu(t + s)fy(t)} 
which reduces to 
(19) cx(s) = 0(s) — 3 | 


Since the assumptions of Theorem 1 are satisfied for every x, the inequality 
(12) holds for — 0 <2 < so that 


(20) | |? S + 0) —$(an —0) 
for every n. This implies (13), since 


(21) + 0) — — 0)) < 0, 


$(z) being a non-decreasing bounded function for — 0 <a2< o. Further- 
more, since %p + Xm for n~m, it is clear from (20) and (21) that the 


function 


(22) oy(r) = 
+0) — | |?) + + 0) —$(an—0)) 


is non-decreasing and bounded for o. Since ¢(z) is supposed 
to be the sum (6), one sees from (22), (19) and (4) that 


fo @) 


(23) cy(s) = (x) ; 


so that, since (18) is the correlation function of (16), the function (22) is 
the periodogram of (16). In particular 


(24) M:{| |?} = &), 


as is seen by placing s = 0 in (18) and (23). 

All of this was deduced under the assumption that f(t) is a function 
which satisfies (I), (II), (IIL). In order to complete the proof of Theorem 3, 
it remains to be shown that, if (I), (II), (III) are satisfied, f(t) is or is not 
almost periodic (B*) according as it does or does not satisfy (IV). 

By definition, f(¢) is almost periodic (B?) if and only if the functions 
(16) satisfy the requirement 


3 
n 
= 
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|?} > 0, (N-> 
According to (24), this can be written in the form 
0) —dv(— ~) > 0,7 (N— 
This condition is reduced by (20), (21) and (22) to the equivalent requirement 


+ 0) — —0) — | 0. 


According to (20), this requirement is satisfied if and only if (15) holds for 
every n. Since it is not assumed that the Fourier constant a(a,) be distinct 
from 0 for n =1,2,---, one can adjoin every given real number z to the 
n is equivalent to the condition that the function (10) of 2 should vanish 
identically. But this is precisely the condition (IV), since (11,) is equivalent 
to (11.). 

This completes the proof of Theorem 3. 


sequence 2,,%2,°**. In this sense, the requirement that (15) holds for every 


7. The results of §3-$6, which refer to a single function f(t), will 
now be applied to the set of almost all functions in Birkhoff’s ergodic theorem. 
To this end, it will first be necessary to collect those straightforward conse- 
quences of this theorem which involve the notions discussed in § 2. 

Let a space S of points P carry a Lebesgue measure for which S is of 
finite measure, and let T', —20 <t< om, be a cyclic group of transforma- 
tions which map S on itself, are measure-preserving, and such that the func- 
tions T'P of (t, P) satisfies the usual measurability condition. Then Birkhoff’s 
ergodic theorem states that, for every function f(P) of class (Z) on S, the 
average M;{f(T'P)} exists for almost all P, and represents a function f*(P) 
of class (L) on 8. 

Let # be a fixed rotation of a circle C on which the are length is assigned 
as Lebesgue measure. Consider on the product space S X C the product flow 
T* X R and the corresponding product measure, and apply Birkhoff’s ergodic 
theorem to this product model. It is then clear from Fubini’s theorem that, 
if x is any fixed real number, the x-th Fourier, constant, 


(25) ap! (x) = 


of f(t) =f(T7'P) exists, for every given f(P) of class (L), for almost all P. 


THE ERGODIC DYNAMICS OF ALMOST PERIODIC SYSTEMS. 805 


The proof of this well-known trivial consequence of Birkhoff’s ergodic theorem 
has been given here only because its analogue corresponding to the mean 
ergodic theorem will also be needed. Let f(P) be of class (L*?) on 8. It has 
been pointed out by Wiener *° that application of Birkhoff’s ergodic theorem 
to a product model, when combined, as before, with Fubini’s theorem, assures 


the existence of the correlation function 


(26) cpl (s) = Mi{f f(T*P)} 
of f(t) =f(T'P) for almost all P. The corresponding fact supplied by the 
mean ergodic theorem (in the mean of the function space (ZL) = (L*) on 8) 
is that 
B 
(27) lim | epf(s) — f f(Tt*P)f(TtP) dt | dpS =0 
A 
holds for — 0 <s< o (it is understood that dpS denotes the volume 
element on S). 
Let 
(28) or!’ = gr! (x), 


denote the periodogram of f(t) =f(T'P), —« <t< o, (for almost all P) ; 


so that, corresponding to (4 
> > 


(29) cpl (s) = f ed, 
and, by (5) 


(30) M,{e-***cp!(s)} = dp! (a + 0) — dp!(x— 0) for every z. 


It turns out that, if 2 is any fired real number, then the averages (25) 
and (26), which exist for almost all P, satisfy the relation 


(31) = | ap! (x) |? 
for almost all P. It is precisely (31) that will make applicable Theorems 1 


and 3. Corresponding to this central rdéle of (31), the proof of (31) is 
somewhat lengthy; it will occupy the whole of § 8. 


20 Cf. loc. cit.2°, the eighth footnote. 
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8. Let f(P) be of class (Z*) on S. According to the mean ergodic 
theorem, there exists on S a function f*(P) of class (L*) such that, in the 
sense of (1 bis), 

lim f | f*(P) — raf f(T*P)dt |? dpS =0 


B—A->00 


Hence, if x is any real number, it follows, by a repetition of the product argu- 
ment which led from the Birkhoff ergodic theorem to (25), that there exists 
for the given z a function f*2(P) of class (Z*) on S such that 


B 
lim | f*2(P (TP) dt |? dpS = 
B—A B—Av 


A 


Since this means convergence in the mean (L?) of S, it implies the existence 
of a subsequence which is convergent almost everywhere on S. Thus there 
exist two sequences, say A;,A>,- - - and B,, B.,- - +, such that 


and, for almost all P, 
Ba 


as n>, 


1 
B An 
A n 


where z is fixed. Since (25) exists for almost all P, it follows that 
f*2(P) ap! (a). 


On substituting this into the definition of f*,(P), one sees that 


B 


lim fi | ap! — = f dt |? dpS = 0. 
B—A +x B—A 


S A 
The complex conjugate of this (real) relation is 


D 
lim | ap! (x) eivsf ds dpS 0, 


S 


| 


B- 
D- 
B 
D. 


lic 
he 
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where s, C, D are written in place of t, A, B. Since the quadratic mean on S 
has the properties of a distance (Schwarz, Minkowski), the last two relations 


imply that 
D 


B 
§ A Cc 


where the limit sign refers to a double limit, (A,B) and (C,D) being 
independent of one another. 

Since the time ranges, AS¢t=B and C=sZD, are chosen inde- 
pendently, and since the length D—C remains unchanged if the range 
C=s=D is replaced by C+tS=s=D-+t, where ¢ is arbitrary, the last 


relation can be written in the form 


B D+t 
f | ap! (x) — f (7 tP)dt f ds dpS = 0. 
D—C->0co S A C+t 
But 
D+t D 
) ds f f(Tt*P) ds; 
Cst 
so that, by Fubini’s theorem, 
B D+t D B 
f ( dt f eizsf (TP) ds eit8ds f dt. 
A Ct C A 


Hence, the complex conjugate of the preceding limit relation can be written 


in the form 


D B 
lim f || ap? (ax) |? — ew f ‘(T*+sP) f(T'P) dt|ds | dpS = 0. 
D—C>x § Cc A 
Since this relation expresses convergence in the mean of (L) = (L’) 


on S, it implies the existence of a suitable subsequence which is convergent 
almost everywhere on S. Thus there exist, for the fixed value of x, two pairs 
of sequences of numbers, say 

A,, By; Bs, * and C2, ° D,, De,° 
such that 


as ow and Cn ~, Dn > as M> 


u- 
ts 
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and, for almost all P, 
Din Bn 
cf f(T*P)f(TtP) dt|ds = | apf (x) | 


m—>0o Cm An 


Since the limit on the left is a double limit, it can be written as an 
iterated limit; so that 


Da. 
lin. lim f(T'P) dtds = | ap! (a) |* 


for almost all P. 
Since f(P) is of class (Z*) on S, there exists, by Birkhoff’s ergodic 
theorem, a finite limit 
lim | f(T'P) |? dt = Mi {| f(T'*P) |?} 


B00 


for almost all P. Hence, it is easily seen from the integrability properties of 
f and T'P, that, if two numbers, say C and D(>(C), are arbitrarily fixed, 
and if the point P is fixed in the complement of a set of measure zero, then 


B 
SJ at 
A 


of s remains for C= s = D under a fixed bound *! (depending on C, D, and 
P),as A>— 0, Bo. Thus it is clear from the Schwarz inequality and 


the function 


from the assumption 


Brow, (n— 


that, if m and P are fixed, then, unless P belongs to a set of measure zero, 
the functions 


8 


dt 


of s, where n = 1,2,---, are uniformly bounded on the interval Cn Ss S Dn. 
Furthermore, since the average (26), defined by (1), exists for — 0 <s< m 


unless P belongs to a set of measure zero, the expression in the last formula 


*1 For the details of the rather elementary proof, cf. N. Wiener, loc. cit.*, p. 159. 


li 
m- 
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line tends for Cm Ss S Dm and for almost all P to the limit cpf(s),as n— ©. 
It follows, tierefore, from Lebesgue’s theorem on term-by-term integration, that 


Dm Bn Dm 


lim 4 f(T*P)f(T'P) dtds = 

Cm Aa Cu 
for almost all P and for every m. 


Accordingly, the relation containing the iterated limit reduces to 


Dim 


1 
lim (s)ds = | ap! (a) |? 
M—>X m~~vVme 

Cm 


for almost all P. But the average on the left of (30) exists whenever the 
correlation function (29) exists; so that, since the latter exists, by §7, for 


almost all P, 


Dim 


D 
lim ————_—— e-'t8cpi(s)ds = lim f = Ms } 


Cm Dox 
holds, in view of lim Cyn lim Dm = ~, for almost all P. 


The last two formula lines show that the proof of (31) is now complete ; 
it being understood that a zero set of points P is excluded in (31) for every 


fixed 2. 


9. It is now easy to prove a theorem which in its simplest case may be 


formulated as follows: 


If f(P) is a bounded, measurable function on a space S of finite measure, 
and if the flow T* is metrically transitive on S, then the function f(T*P), 
—xr7 <t< ~, 1s or ts not almost periodic (B*) for almost all P according 
as its periodogram (which, by §7%, exists for almost all P) is or is not a step 


function for almost all P. 


In fact, the metrical transitivity of T* means that M;{g(TtP)} is con- 
stant almost everywhere on S for every function g(P) of class (Z). But then 
it is clear from the proof of the existence of the correlation functions (26), 
that (26) is independent of P for almost all P. It follows, therefore, from 
(29) and from the uniqueness theorem of the Fourier-Stieltjes transform, 


9 


= 
= 
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that the monotone function (28) of x is independent of P for almost all P. 
Hence, the preceding italicized statement is implied by the following theorem : 


THEOREM 4, Let f(P) be a bounded, measurable function on a space § 
of finite measure, and let T'P, —«w < to, be a measure-preserving, (t, P)- 
measurable transformation group of S into itself; so that, by § 7, the function 
f(TtP), —w<t< oo, has a periodogram ¢pl(x), —x om, for 
almost all P. Suppose that there exists an enumerable set of points x which 
contains the set Xp! for almost all P, where Xp! denotes, for almost all P, the 
sequence of those points x at which the monotone function gp! ts discon- 
tinuous. Then the function f(T'P), —x2x <t< o, is almost periodic (B*) 
for almost all P if and only if the function dp! (x) of x is a step function of « 
for almost all P. 


In the proof of Theorem 4, the assumption that f(P) is a bounded func- 
tion will be used only via Theorem 2. Hence, if the answer to the question 
formulated at the end of § 4 were in the affirmative, Theorem 4 would follow 
for every function f(P) of class (L*). Incidentally, it will follow from 
Theorem 6 below, that every function of class (Z*) can be admitted in the 
particular case italicized, before Theorem 4. 

In order to prove Theorem 4, let f(P) be a fixed, bounded, measurable 
function on S. According to § 7, there exists, for every real number 2, a. zero 
set, say Z,, such that the z-th Fourier constant, ap/(x), of f(T'P) exists for 
every point P of S—Z,. Since f(T'P) has, by § 7, a correlation function, cp’, 
and therefore a periodogram, ¢p’, for almost all P, the zero set Zz can so be 
chosen that not only the 2-th Fourier constant but also the periodogram of 
f(T'*P) exists for every point P of S—Z,. Finally, since (31) holds for 
almost all P for every fixed x, the zero set Z, can so be chosen that (31) is 


satisfied by every point P of S—Z,. 

According to the hypothesis of Theorem 4, there exists a zero set, say Z*, 
and a sequence, say 2, 2%,°-*, such that, if P is any point of S —Z*, then 
the monotone function — <a< is defined (i.e, f(7T'P) has 
a correlation function) and has no discontinuity points x distinct from every 
Xn. It follows, therefore, from Theorem 2, that, if P is any point of S — Z*, 
the z-th Fourier constant, ap/(x), of f(T*P) exists and vanishes for every x 
distinct from every 2». Thus, if P' is any point of S— Z*, then 


gp! (x + 0) — gr! (x—0) =0 = | ap! (zx) |? 


for every x distinct from every x,. Hence, it is clear from (30), that, if P 
Z*, then (31) is satisfied for every x distinct from every <p. 


is any point of S 
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This, when compared with the above definition of the zero sets Zz, 
—o <2£< o, implies that, if the point P is in none of the zero sets 


then the z-th Fourier constant of f(T'P) exists for every x, the correlation 
function of f(7'*P) exists, and (31) is satisfied for every 2. In view of (30) 
and of the equivalence of the two relations (11,), (112), one can express this 
by saying that conditions (I), (II), (IV) of Theorem 3 are satisfied by 
f(t) =f(T'P) for every point P not contained in the zero set 


Since Theorem 3 implies that, in case (1), (II), (IV) are satisfied, (III) 
is necessary and sufficient for the almost periodicity (B*) of f(t), the proof 
of Theorem 4 is complete. 

It is clear from this proof that the assertion of Theorem 4 supplying a 
necessary condition for the almost periodicity (B?) of almost all function 
f(T'P) certainly holds for arbitrary, and not only for bounded, functions 
f(P) of class (L*), and is, in addition, independent of the hypothesis of an 
enumerable set °°. 


10. The assumptions of Theorem 4 are so specific that if its assertion is 
true for certain bounded, measurable functions f(P) on S and for a given 
flow, its assertion need not be true for every bounded, measurable function 
f(P) on S and for the same flow. Thus there arises the question as to a 
criterion which corresponds to that supplied by Theorem 4 but involves only 
the flow on S, without involving a particular function f(P) on S. In order 
to deduce from Theorem 4 such a criterion, i¢ will from now on be assumed 
that there exists on S a complete orthogonal system. This assumption is made 
because it will be necessary to establish the connection between the periodo- 
grams gp’, belonging to individual points P of S, on the one hand, and the 
considerations of Carleman ** and Koopman,” concerning integrals of f(7'P) 
over S, on the other hand. 


*2'T. Carleman, “ Application de la théorie des équations intégrales linéaires aux 
équations différentielles de la dynamique,” Arkiv fér Mat., Astr. och Fys., vol. 22 
(1931), no. 7. 

23 B. O. Koopman, “ Hamiltonian systems and linear transformations in Hilbert 
space,” Proceedings of the National Academy of Sciences, vol. 17 (1931), pp. 315-318. 


n 
h 

) 
n 
n 
0 
ye 
yf 
r 
n 
y 
v 


812 NORBERT WIENER AND AUREL WINTNER. 


The latter considerations center around the remark that, if the transition 
from a function fp =f(P) of class (Z*) on S to the function fr=f(T'P), 
where ¢ is fixed, is thought of as a transformation, f; = U'f,, of the Hilbert 
space (L?) of S into itself, then the measure-preserving property of 7 on S 
can be expressed by saying that U‘ is unitary.** Correspondingly, the 
group property of JT‘ and the (P,¢)-measurability of T‘P mean that U', 
— oo <t< o, isa cyclic group such that, in the sense of strong convergence, 
Us—>»U' whenever s—>t. 

Let f.g denote the scalar product, 


(32) f.g— ars, 


of two vectors of the Hilbert space (L*) on S, and let 
(33) f.®(2)f, —oc rc 0, 
be the spectral form of the group of unitary forms, 
(34) f.U*f, ow. 


Thus, if x is fixed, (33) is a bounded Hermitian form in f (boundedness 
being meant in the sense of Hilbert) ; while if f is fixed, (33) is a non-negative, 
non-decreasing, bounded function of x and can be chosen as normalized, 


corresponding to (4 bis), by 
f.©(— f=0 and f.6(¢—0)f =—f.(x)f; so that f.6(0)f—f.f. 


Then Ut, — «0 <t< o, determines, according to Stone,”° exactly one 
—«o<2x< , such that (34) is representable by means of (33) in the form 


** Since every unitary operator is bounded, and since every bounded operator in 
Hilbert space is, according to Toeplitz, a bounded matrix, the spectral theory of uni- 
tary matrices (A. Wintner, loc. cit.*°) has not been generalized by replacing it by the 
corresponding theory of unitary operators. In this connection, cf. A. Wintner, “ Dyna- 
mische Systeme und unitiire Matrizen,”’ Mathematische Zeitschrift, vol. 36 (1933), pp. 
630-637. 

25M. H. Stone, “Linear transformations in Hilbert space,” Proceedings of the 
National Academy of Sciences, vol. 16 (1930), pp. 137-139. Previously, the theory of the 
discrete, instead of continuous, unitary cyclic groups in Hilbert space, was developed 
by A. Wintner, “ Zur Theorie der beschriinkten Bilinearformen,”’ Mathematische Zeit- 
schrift, vol. 30 (1929), pp. 228-282. It was shown by S. Bochner (“ Spektralzerlegung 
linearer Scharen unitiirer Operatoren,” Sitzungsberichte der Preussischen Akademie der 
Wissenschaften, 1933, p. 371), that the method applied there in the discrete case can 
easily be transcribed to the continuous case, and this approach appears to be the 
simplest among the known proofs of Stone’s result. 


w 
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(35) f.Utf= f cited, (f.®(xr)f). 
—oo 
Hence, ® is uniquely determined by the flow <t< 
Correspondingly, the point spectrum of the flow can be defined to be the 


point spectrum in the sense of Hilbert, that is, the sequence 


of those points 2, at which the monotone function (33) of x is discontinuous 
for at least one f. In particular, the flow will be said to have a pure point 
spectrum if the spectral form, (33), of the flow reduces to 
(37) & +0) —&(a%—0))f, 

where x and f are arbitrary and z, ranges over the point spectrum, XY. 

A connection between these notions, which refer to the whole of the 
flow, and notions involving individual stream lines in the flow can easily be 


established, as follows: 


THEOREM 5. If (33) denotes the spectral form of a measure-preserving, 
(P,t)-measurable flow Tt on a space S of finite measure, then 
(38) pp! (x)dpS = 


S 


for every function f(P) of class (L*7) on S, where dp! denotes, for almost all 
P, the periodogram of f(T'P), —x <t< om. 


CoroLitaRy. A real number, X, ts a discontinuity point of the monotone 
function op! of x for at least one fixed f and for a P-set of positive measure, 
if and only if x is in the point spectrum, X, of the flow; furthermore, the 
flow has a pure point spectrum if and only if gp! is, for every fixed f and for 
almost all P, a step function of x. 


In order to prove Theorem 5, let f(P) be a function of class (ZL?) and 
let s, A, B be real numbers. Then, according to Fubini’s theorem, 


B B 
S A A Ss 
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Hence, on replacing in the inner integral on the right the integration variable 
P by T'P, where ¢ is fixed, one sees from 


B 
f f dt (B—A) dpS 
S Ss 


A 
that 


B 
fs f(Ts'P)f(P)dt f f(T*P)f(P) des. 
S A 64 


It follows, therefore, from (27) that 


J ep! (s)dpS f f(T*P)}(P) dps. 


Ss S 


On applying (29) on the left and (32) on the right, one obtains 


S 


since U*f = f(T*P), by the definition of U*. Hence, on writing ¢ for s, one 


sees, by applying Fubini’s theorem on the left and (35) on the right, that 


00 
f eitrd, ( pp! (x) dpS eitrd, (f. @(x)f) 
ec S 


where 


co <t< o; whence (38) follows from the uniqueness theorem of 


the Fourier-Stieltjes transform. 


11. The flows which are almost periodic in the sense indicated at the 


beginning of § 10 can now be characterized as follows: 


THeEoREM 6. Let T* be a measure-preserving, (P,t)-measurable flow on 
a space S of finite measure, and let Then f(T'*P),—_ <t< ~, 1s, 
for every function f(P) of class (L%) on S and for almost all P, almost 
periodic (B%) if and only if the flow has a pure point spectrum. 


In other words, the flow transforms every Lebesgue integrable function 
on S into a Besicovitch almost periodic function of t for almost all P if and 
only if the flow is Bohr almost periodic. It is understood that by the Bohr 
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almost periodicity of the flow is meant the Bohr almost periodicity of the 
scalar function (35) of t, where f =f(P) is any function of class (L*) on S. 
Then the equivalence of the last italicized statement with Theorem 6 is clear 
from the remark made at the end of § 2. 

In order to prove Theorem 6, associate with every f(P) on S the sequence 
of functions f;(P), f2(P),: on S which are defined by 


( 
(39) fa(P) = 


WR 


It is then clear, either from Birkhoff’s proof of his ergodic theorem or from 


the proof of Wiener’s dominated ergodic theorem, that 
(39 bis) M.{| f(TtP) —fr(T'P) as n> 


holds for almost all P, if f(P) is any function of class (Z4) on S. Hence, if 
f,(T'*P) is almost periodic (B%) for every n and for a fixed P which is not 
in a certain set of measure zero, then f(7‘P) is almost periodic (B%) for the 
same P. But every fn(TtP) is, by (39), a bounded function of ¢ and is, 
therefore, either almost periodic (B%) for every g = 1 or not almost periodic 
(B71) for any g = 1. Hence, Theorem 6 is equivalent to the following state- 
ment: For every measurable, bounded function f(P) on S, the function 
f(T'P) of ¢ is almost periodic (B?) for almost all P if and only if the flow 
has a pure point spectrum. Since the truth of this statement is clear from 
Theorem 4 and from the Corollary of Theorem 5, the proof of Theorem 6 is 
complete. 

In order to attempt a dynamical understanding of the actual content of Theorem 6,. 
use will be made of Koopman’s interpretation °° of the point spectrum in terms of the 
“first integrals” of the flow 7¢; an interpretation which holds also when the point 
spectrum is not pure. In fact, whether the point spectrum, (36), does or does not 
satisfy the condition, (37), of purity, a real number, 2, is in the point spectrum if and 
only if eit is a characteristic number of the unitary form (35) for every ¢, i.e., if and 
only if there exists on S a function F = F'(P) which is not 0 (almost everywhere on S), 
is of class (Z?), and has the property that 
(40) (P) = F(TtP), where —~ <t<.«, 
is an identity (for almost all P). A corresponding statement holds concerning the 
multiplicities of characteristic numbers. But if « = 0, then (40) reduces to 

F(P)=F(TtP), —x<t<., 
which means that F'(P) is a “ first integral” of the flow Tt (provided that a subset of 


*° B. O. Koopman, loc. cit.**, p. 318. 
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S of measure zero is disregarded), unless F(P) is constant on S (almost everywhere). 
If, on the other hand, « ~ 0, then, on assuming that F(P) is regular enough to make 


F*(P) =argF(P), i.e, expiF*(P) =F(P)/|F(P)|; 
a meaningful definition of a function F* of the position P on S (almost everywhere), 
one sees that (40), can be written in the form 
(40 bis) F* (TtP) = at + F*(P)(mod2r), where —~ <t< 
But (40 bis) means that also F*(P) corresponds to a “ first integral,” namely, to one 
belonging to an “ ignorable coérdinate ” leading to an “ angular variable.” This is the 
interpretation of Koopman, referred to before. 

Suppose now that the flow on S has a pure point spectrum. Then, and only in this 
case, the characteristic numbers w and the corresponding characteristic functions F'(P) 
determine all unitary invariants of the flow on the Hilbert space (L*) of S. Accord- 
ingly, the flows of pure point spectrum can be thought of as characterized by the fact 
that their complete system of unitary invariants is expressible in terms of “ first 
integrals” alone. But then Theorem 6 means that the “first integrals” of exactly 
those flows have this completeness property which determine “conditionally periodic” 
paths as “general (or, rather, generic) solutions.” 

This seems to agree with certain statements of physicists, which have never been 
motivated in a mathematically reasonable direction; actually, most of the problems in 
question are not even meaningful without definite topological assumptions in the large. 
Correspondingly, the curious fact expressed by the following theorem is likely to have 
topological implications in the dynamical case, where S is a manifold and 7% is a 
continuous flow on S. 

THEOREM 7. If a measure-preserving, (P,t)-measurable flow T' on a 
space S of finite measure has the property that f(T'P), —«2 <t< o, is, for 
every function f(P) of class (L*) on § and for almost all P, almost periodic 
(B?), then all the Fourier exponents of f(T'P) are contained, for almost all P, 
in the point spectrum, X, of the flow, and so ina single enumerable set which 
is independent of P (and also of f). 

Theorem 6 and the hypothesis of Theorem 7 imply that the flow has a 
pure point spectrum. Hence, Theorem 7 follows from Theorem 2 and from 
the Corollary of Theorem 5. In fact, a real number z is called a Fourier 
exponent of an almost periodic function f(t), —o <t< o, if the z-th 


Fourier constant of f(t) does not vanish. 


12. The results obtained will now be applied to the case of metrical 
transitivity. In order to simplify some of the formulae belonging to this case, 
it will be assumed that meas S 1. This is only a normalization, involving 
no loss of generality, since the case meas S = o was always excluded, while 
the case meas S = 0 is always trivial. Since meas § — 1, the flow Tt on S is 
metrically transitive if and only if 


Mi{f(TtP)} = f(P)dps 
8 


holds for almost all P, whenever f(P) is of class (L) = (Z*) on S. 
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THEOREM 8. If the flow Tt on S, where meas S —1, ts metrically 
transitive, there exists for every function f(P) of class (L) on S§ a subset, Z, 
of S which is of measure zero and such that, unless P 1s in S —Z, 

(i) the a-th Fourier constant, ap!(x), of the function f(T'P), 
—°o<t< o, (which need not be almost periodic (B) in this case) exists 
for every x and determines an “ intensity,” | ap'(x)|, which is independent of 
P (so that only the “ phase,” arg ap'(x), of the a-th “amplitude,” ap'(z), 
can vary with P), where Z 1s independent of x; 


(ii) the point spectrum, X, of the flow (which may or may not be a 
pure point spectrum) contains every Fourier exponent (“frequency”) of 
f(T'P), <t< every real number satisfying ap!(x) ~ 0; 


(iii) in the particular case where the function f on S is of class (L?), 


the correlation and intensity functions of f(T*P), —w <t< om, are 
explicitly given by 
(1) op! (x) 
and 
(II) |apf(x) |? =f.(@(a@+0) —O(e#—0))f, 


where f.®(2)f denotes the spectral form of the flow. 


In order to prove this theorem, it will be sufficient to establish the first 
part of (iii), i.e. (1), for every f of class (Z*), but (i), (ii) and the second 
part of (iii), i.e. (IL), only under the assumption that the given function 
on S, instead of being of class (Z) and of class (L*) respectively, is bounded 
and measurable. In fact, if f is of class (Z) and f, denotes the function (39), 
then an obvious adaptation of (39 bis) shows that, when = is fixed, 

lim ap!(x) =ap!(x) for almost all P; 

so that the statements (i), (ii) follow for the given function f of class (L), 
if they are granted for every fn. If in addition f is of class (LZ), then, by (39), 


lim | 0; 


so that (II) follows for the given f, if it is granted for every fn. 
Next, let f be of class (L*). Then, as shown in the proof of the italicized 
statement at the beginning of §9 (in the paragraph preceding Theorem 4), 
the periodogram ¢p/ (exists and) is independent of P for almost all P. Hence, 
(I) follows from (38), where meas S = 1, for every f of class (L?). 
Accordingly, the proof of Theorem 8 will be complete if one verifies 
(i), (ii) and (II) for every bounded f(P). But (II) implies the whole of (ii) 


818 NORBERT WIENER AND AUREL WINTNER. 


and it also implies the second part of (i), i.e., the statement that only the 
phase of ap/(x) can vary with P. Finally, (1) reduces (II) to the relation 


| ap! (x) = dp! (x + 0) — ppl 0). 


Hence, in order to complete the proof of Theorem (8), it is sufficient to prove 
that there exists for every bounded measurable function, f, a zero set, Z, such 
that ap/(x) exists for — 0 <a2< o and satisfies the last relation, unless P 
is in S—Z. But the existence of such a Z follows from (31) and from 
Theorems 2 and 3 in the same way as in the proof of Theorem 8; in fact, 
(I) implies that the enumerability hypothesis of Theorem 8 is satisfied. 
The, simplest illustration of Theorem 8 is supplied by the “strongly 
mixing” flows, considered by Koopman and von Neumann.” In fact, these 
flows can be characterized as those metrically transitive flows for which the 
point spectrum, X, consists of a single point, z = 0; so that the criterion of 
Theorem 6 for almost periodicity (B) is violated in its full extent. The other 
extreme case, that is, the case of a metrically transitive flow with a pure point 


spectrum, is represented by any Kronecker-Weyl flow on a torus. 


APPENDIX. 


As mentioned at the beginning of § 5, the almost periodicity (B*) of a 


measurable function f(t), —o<|™|@t< o, has thus far been meant in the 
sense that there exists for f(t) a sequence of trigonometric polynomials, say 

such that 

(42) M:{| f(t) —rn(t)|?} exists for n = 1, 2,- - 

and 

(43) M:{| f(t) —rmn(t)|*} as n> @, 

where M;{---} is meant in the following sense: 


B A 
(44) M:{g(t)} = lim mais g(t)dt, i.e, = lim > f g(t)dt. 
— 


Almost periodicity (6°) in this sense is a sharper property than almost 
periodicity (4°) according to the original definition of Besicovitch.** In fact, 
he requires the existence of a sequence (41) of trigonometric polynomials 


*7 B. O. Koopman and J. v. Neumann, “ Dynamical systems of continuous spectra,” 
Proceedings of the National Academy of Sciences, vol. 18 (1932), pp. 255-263. 
2° Cf. A. 8. Besicovitch, Almost Periodic Functions, Cambridge, 1932. 
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satisfying (42) and (43) when M:{---} is defined not by (44) but as a 
principal limit, i. e., by 


A 
(45) Mi{o(t)} lim af g(t) dt. 


The replacement of (44) by (45) is essential, for instance, in the theory of 
the Riemann zeta-function,”* since those theorems on ordinary Dirichlet series 
which center about the mean-value theorem of Schnee do not, in general, hold 
if (44) is required instead of the more inclusive definition, (45), of M:{--- }. 
Needless to say, the proofs of Theorems 1, 2, 3 hold, without any change, if 
(44) is replaced by (45) in (2), (3), (4) and (42)-(48). The resulting 
wordings of Theorems 1, 2, 3 are then, of course, independent of their original 
wordings. On the other hand, Theorem 5 then expresses a weaker statement 
than its original wording. 

The situation with regard to Theorem 5 becomes quite different if (44) 


is replaced not by (45) but by 


> 
> 


/ 
(46) Mi{g(t)} = lim g(t)dt, 
A 


B—A>ow 


where the limit process consists of an indefinite increase of the length of the 
integration domain A =¢= 8B; a process which is compatible with either 
A—o or B>— ~. Let f(t) be called almost periodic (W*) if there exists 
a sequence (41) of trigonometric polynomials satisfying (42) and (43) in 
the sense of (46). It is easy to see that f(t) is almost periodic (W?) if and 
only if it is almost periodic in the sense of Weyl.*° The proofs of Theorems 
1, 2, 3 hold without change if (44) is replaced by (46) in (2), (3), (4) and 
(42)-—(43) and, correspondingly, (B?) in Theorem 3 by (W?). 

On the other hand, Theorems 4, 5, 6 become false if (44) ts replaced by 
(46) and, correspondingly, (B4) by (W4). This is implied by the fact that 
a measure-preserving, (?,¢)-measurable flow 7* on S may or may not be of 
the unrestricted type, that is to say such that M;{f(T'tP)} exists, for every 
function f(P) of class (1) on S and for almost all P, if M¢{---} is meant 
in the sense (46), instead of the restricted sense (44) of Birkhoff’s ergodic 
theorem. 


2° Cf. A. Wintner, “The almost periodic behavior of 1/¢(1 + it),” Duke Mathe- 
matical Journal, vol. 2 (1936), pp. 443-446; “ Riemann’s hypothesis and almost periodic 
behavior,” Universidad Mayor de San Marcos, Lima, vol. 61 (1939), pp. 575-585. 

°° H. Weyl, “ Integralgleichungen und fastperiodische Funktionen,” Mathematische 
Annalen, vol. 97 (1926), pp. 338-356. 
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ADDENDUM.* 


Since this paper was written, it has been possible to decide the alternative 
formulated at the end of §4. The answer to the question turns out to be 
affirmative. In other words, the assertion of Theorem 2 holds without the 
assumption 
(47) | f(t)| < const. 
also: 

THEOREM 2bis. Jf f(t) has a correlation function, the x-th Fourier 
constant of f(t) exists and vanishes for all those values of x which are 
continuity points of the periodogram of f(t). 

In order to simplify the formulae occurring in the proof, let (1), where 


<t< om, be replaced by 


1 
(48) Mi{g(t)} = lim =f g(tyat, 


where 0 =t¢< o. It is clear from the remark which follows (1), that this 
modification is unessential. 

For sake of a further typographical simplification, let it be assumed that 
(49) f(t)=0 for 
This assumption involves no loss of generality, since none of the t-averages M 
occurring in Theorem 2 bis is influenced by a change of the values of f(t) on 
a bounded f-interval. 


In view of (49), the function 
t 


(50) F2(t) (y) dy 


is bounded on every bounded (/-interval. Since the existence of the correlation 
function, (3), implies that 

(51) Mi{| f(t) |?} < 

it follows from the Schwarz inequality that 

(52) | F*(t)| < const. 

for 0 =t< o and for every z. 


The proof of Theorem 2 was based? on a Tauberian fact, according to 
which M;{g(t)} exists and vanishes whenever 


* Received May 2, 1941. 


— | 
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(53) | g(t)| < const. 

and 

sin? ¢ 

(54) f g (ut) dt+>0 as u>o., 
0 


It was shown loc. cit.1° that if f(#) has a correlation function, and if x 
is any continuity point of the periodogram of f(t), then 


00 
(55) lim e-tatf(t) = (), 
€ 
0 


€—> 00 


whether the additional restriction | f(t)! < const. is or is not satisfied. In 
fact, (55) becomes identical with the last formula line of § 5, loc. cit.*°, if one 
replaces the sequence {a,}, considered there, by the present case of a function, 
f(t). The restriction | f(t)| < const. was there used, not in the proof of (55), 


but only in the passage from (55) to 
(56) M;{e-**f(t)} —0; 


this passage having been based on the Tauberian theorem quoted before. 

In order to prove Theorem 2 bis, it will now be shown that (51) and (55) 
imply (56) without the restriction | f(t)! < const. also (it being understood 
that the existence, and not the vanishing, of the mean-value (56) is the 
essential part of the statement). 

First, on writing ¢ for ef in (55), 


OO 


€ 2 


Hence, on placing v =1/e and averaging with respect to v between = 0 
and a large v(— uu), 


i ( if yt) —— it) dv-0 as u>o, 
Us 


and so, by Fubini’s theorem, 


0 0 


ve 

de 

le 

, 
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(in fact, (49) and (51) imply that the last iterated integral, when written 
as a double integral, exists absolutely for every fixed u>0). Since 


ut 
1 1 


0 


is, by (50), identical with F*(ut), it follows that 
oo 
sin’ 
f F2(ut) as u->o. 
0 


The last relation and (52) show that both conditions, (54) and (53), 
of the Tauberian theorem are satisfied by g =F". Hence, Mr{g(t)} exists 
and vanishes for g =F”. This means, by (48) and (50), that 


u t 
t 
t 
0 0 


Since (49) and Fubini’s theorem imply that 


u t u u 
( v) e~tat ) 
(f aw) f (f tO ww) at 


0 0 0 t 
and since 


it follows that 


(57) Lf e*ztF(t) log - dt — 0, [cf. (49) ], 


0 


where ©. 


The assertion of Theorem 2 bis is (56), that is, by (48), 


While it is an obvious Abelian fact that (58) implies (57), it is clear that 
(57) in itself cannot imply (58). However, it will now be shown that (57) 
implies (58) in virtue of (51). This will complete the proof of Theorem 2 bis. 


= 


THE ERGODIC DYNAMICS OF ALMOST PERIODIC SYSTEMS. 823 


On placing g(t) = e-‘**f(t), where z is fixed, one sees from (49) that 
(57) and (58) become 


(59) g(t) log” dt 0 
U « t 
1 
and 
1 
(60) g(t)dt 0 


respectively; while (51) becomes 
(61) Mif| 9(t)|?} < 


Thus all that is to be proved is that (59) implies (60), if (61) is satisfied. 
Actually, (59) implies (60) whenever 


u 


(62) iim + f | g(t) |?dt < © 
u>oo U 
1 
(and even if only 
2 bis | ( 
(62 bis) lim g(t) |?»dt< 


x 
holds for some p> 1). 

In order to prove this, let 3 be a fixed number in the interval 0< 0 <1. 
Then w— © is equivalent to Ju—> ~ ; so that, on writing du for u in (59), 
one sees that 

ou du 
log 


1 

f g(t)dt Du f q(t) lo dt 
1 


as u—> co. Since 


u u 
1 u 1 _ f u 
ny g(t) log di +>, g(t) log dt g(t) log dt 0, 
1 du 1 


by (59), it follows that 


log 
fi g(t) )dit — 9 ( (t) log dt 0; 


so that, since 


: 
bed 
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g(t) log dt 
du 


division by | log gives 


u 


< flo (t)\log dt =| log f | g(t)! a, 
ou 


ou 


ou u 
! 


lim aS g(t) dt << lim | g(t) | dt. 
1 


| 
u co | UU Ue 
du 


But the upper limit on the left is independent of #; so that, on estimating 
the integral on the right by the Schwarz inequality 
u 


‘ 


ou ou 
one obtains the inequality 


lim | g(t)dt |= lim € g(t) it) 
1 


1 


On letting here }->1—0, one sees from the assumption (62), that the 
proof of (60) is complete. 

It is clear from this proof that (62) can be generalized to (62 bis), 
since the inequality of Schwarz can be replaced by that of Holder. 
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THE CONTINUITY OF THE AREA OF HARMONIC SURFACES 
AS A FUNCTION OF THE BOUNDARY REPRESENTATIONS.* 


3y Marston Morse and C. TompxIns. 


1. Introduction. We are concerned with surfaces § in a euclidean 
m-space of coordinates = (x). We restrict ourselves to harmonic 
surfaces 7; = 2;(u,v), that is to surfaces for which the functions 2;(u, v) 
are harmonic functions of (u,v). These functions are defined on a connected 
region B in the (u, v)-plane, or on a Riemann surface spread over the (wu, v)- 
plane. The region B is bounded by a finite number of non-intersecting circles. 

We begin with the case where B is single-sheeted and is bounded by a 
single circle C. For simplicity we take B as the unit disc wu? + v? << 1. Let 
S be a surface with z7;(u,v) harmonic on B and continuous on the closure of B. 
Let (r,@) be ordinary polar codrdinates in the (u,v)-plane. Set 


x; (cos 6, sin?) = pi (6) 1,---,m). 


We assume that the functions p;(@) have bounded variations. We shall 
designate the set pi(@) (t—=1,--+,m) by p. 

Subject to the above restrictions we shall regard p as a variable, more 
particularly as a point in a metric space M. To define the distance pq between 
two points p and g in M, let Vi(p) denote the total variation of p;(@) on the 
range 0=@= 27. Relative to g, Vi(q) is similarly defined. We set (cf. 


footnote 5 in [3]) 
(1.2) pq = = {| Vil(p) — Vi(q)| + max | pi(6) — |}, 
where the maximum is taken over the range 0S 6S 2x. It is clear that pq 


satisfies the usual axioms for a metric space. Let Q(p) be the area of the 
harmonic surface defined by p. Our principal theorem is as follows. 


The area Q(p) varies continuously with p on M. 


In § 2 we shall give an example showing that this theorem is false if the 
distance pq is defined by the formula 


(1.3) pq = max | pi (9) — qi (9) |. 


In further appraising the meaning of this theorem, recall that the 
Dirichlet-Douglas integral (cf. [2]) 


* Received March 15, 1941. 
10 


826 MARSTON MORSE AND C. TOMPKINS. 


D(p) =3 SJ + dudv 


is lower semi-continuous. However D(p) is not in general continuous for p 
on M, or even on subsets of M for which D(p) is finite. An example showing 
this will be given in § 2. 

The functions D(p) an? %(p) are important instruments in the study 
of minimal surfaces. They ai  »..uected by the relation (see [1], p. 312) 


Q(p) = D(p), 


with the equality holding if and only if S' is a minimal surface. When 9 is 
minimal, p is termed a critical point of D(p). For critical points p then 


Q(p) = D(p). 
We accordingly have the following corollary of our theorem. 
The integral D(p) 1s continuous on the subset of critical points p of D(p). 


This corollary is more striking if one recalls the corresponding theorem 
for extremals in ordinary, regular, positive definite problems. Replace p, for 
example, by an arbitrary curve joining two fixed points on a closed regular 
surface with codrdinates which are 4-fold differentiable with respect to the 
surface parameters (u,v). Replace D(p) by the length L(p) of p. The dis- 
tance between two curves p and g may be taken as that defined by Fréchet. 
The analogue of our corollary is the theorem that L(p) is continuous on the 
subset of geodesic segments which join two fixed points of our surface. 

Our theorem has been used in the paper [3] to lighten the conditions 
(cf. [4], [7]) under which minimal surfaces of unstable type are proved to 
exist. We shall make further applications of this theorem in the theory of 
minimal surfaces in the large. 

The theorem is extended to surfaces of general topological types where 
B is bounded by n circular contours (cf. [6]) and is a region on a Riemann 
surface. In this general case the area is shown to depend continuously not 
only on the boundary vectors, but also on the radii and centers of the circles 


bounding B. 

2. Preliminary remarks and examples. Let S be a harmonic surface 
a= 2;(u,v) defined over the region B, r<1, with the boundary repre- 
sentation 2; = pi(@) as in (1.1). We refer to the metric space M of boundary 
representations p with distances pq defined as in (1. 2). 


Example 1. There are many examples which show that D(p) is not in 
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general continuous on M. We shall give an illuminating example of this type, 
illustrating properties of D(p) not ordinarily shown in such examples. 

To that end we shall define a sequence p" (m—1,2,- + -) of repre- 
sentations p in the space (x) = (21,22). We shall make use of parameter 
transformations « = ¢"(@) in which ¢"(6) is continuous and increasing, with 
+ =¢"(6) + 22. In the (a, @)-plane the graph of a—¢"(6) shall 
consist of three successive line segments with ¢"(0) =0. The middle line 
segment shall have its midpoint at (7,7), a length of 1/n, and a positive 
slope k, to be defined. The curve p” shall have the form 


= cos ], = sin (0S05 2z). 


If kn becomes infinite for a fixed n, D(p™) does likewise as is well-known 
(cf. [1], p. 300). We shall take &, so large that D(p") exceeds n, so that 
D(p") becomes infinite with n. Nevertheless $"(@) converges uniformly to 4, 
while p” converges to the representation q, 


(2.1) = cos 0, = sin 6. 


Observe that D(q) is finite. Thus p" converges on M to qg, while D(p") does 
not converge to D(q). 


Example 2. Let M* be the metric space of rectifiable curves p with 
distances defined as in (1.3). We shall give an example showing that Q(p) 
is not continuous on M*. We shall make use of a result of Radd, [5], p. 92. 
We shall consider curves p in the space of codrdinates (2, y,z). Let C be the 
projection of p on the (2, y)-plane. Every point (2, y) not on C has a definite 
integer n(x, y) as its topological index with respect to C. With Radé define 
N(a, y) as | n(x, y)| if (az, y) is not on C, and as 0 otherwise. Then according 
to Radé, 


(2.2) a(p) = N(ay)azay, 


the integral being taken over the whole plane in the sense of Lebesgue. 
We shall give an example in which a circle qg of the form 


(2. 3) x = cos 8, y = sin 6, == 0) 


is a limit of a sequence p” of curves in the space (x, y, z), convergence being 
defined by the metric of M*. To that end we choose a point P on q, and 
replace a segment t, of g with midpoint at P by a curve with the end points 
of t, and with a spiral-shaped projection on the plane z= 0 so chosen that 
for the curve p” replacing g, Q(p") exceeds n. This can be done in accordance 
with (2.2) in such fashion that the length of ¢, is finite, that the diameter 


= 

= 
= 
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of ¢, tends to 0 with n™, and that Q(p”) is finite. It is clear that p” will 
converge to the circle q in the sense of M*, provided the parameter 6 is properly 
assigned to the segments t,. Thus p” converges to g in the sense of M*, while 
Q(p") does not converge to 2(q). 

The lengths of the above curves p” are not bounded. This leads to the 
following question, the answer to which is not known, to the authors. For 
curves p of uniformly bounded length with a metric given by (1.3) is Q(p) 
continuous ? 

We shall need a few facts concerning the spaces M and M*. Recall that 
the metrics of M; and M* are defined respectively by (1.2) and (1.3). Let 
vi(«, p) denote the total variation of p;(@) on the interval For 
p fixed, vi(,p) is continuous in « For « and p both variable, 1;4(a, p) is 
lower semi-continuous, convergence of « being ordinary, and convergence of p 
being convergence on M*. If p converges to g on M, v;(27, p) converges to 
v;(2r,q) in accordance with the definition of pg on M. We shall prove the 


following lemma. 
LemMA 2.1. If p converges to q on M, v4(0,p) converges to (6, q) 
umformly with respect to (0 =O 2x). 


Let p denote the representation pi(— 6) (t—1,---,m). If a and B 
are non-negative and sum to 2z, 


(a, P) -+- Vi (B, Pp) vi (27, P); 


so that 
(2. 4) v4 p) = v4 (2x, p) — vi (8, p). 
Now v;(8,) is lower semi-continuous in its arguments so that — v;(B, p) 


is upper semi-continuous. Since v; (27, p) is continuous in p, it follows from 
(2.4) that p) is upper semi-continuous in p). But vi (a, p) is also 
lower semi-continuous, and hence continuous. 

The uniform convergence affirmed in the lemma follows from the usual 
cluster point argument taking the compactness of the interval 0 = 05 2x 
into account. 


3. The theorem for a single contour. On Set 
xi(r cos 6,r sin 8) =hi(r, 6), (4am 1,---,m), 


and recall that for r < 1, 


1 aod (1 —r?) 
ps(a) 1 — 2r cos (@— a) +7 
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Let Qp(p) denote the area of that part of S on which rp. We suppose 


that 0< p< 1. Recall that 


(3. 2) Q(p) =lm Qp(p). 
p=1 


When p is fixed, Qp(p) is continuous in p for p on M, for the area is given 
by the classical formula 


27 p 

(3.3) (if —1,---, ms <j) 
0 0 i,j 

where 

and the partial derivatives involved vary continuously with r, 6, and p on M. 


To establish the continuity of Q(p) on M, we shall first obtain an 


estimate of 


(3.5) OQ (p) —Qo(p) (ON<a<p<l) 


for o and p near 1. It follows from (3.3) that 


or p 
(3. 6) 2,(p) —Qe(p) | Ai | drdo 
0 o ij 


It will be sufficient for our purposes to obtain an estimate of the integral 
(3. 7) ff | au | dra (O< 
0 


To that end we use (3.1), and set 
(3. 8) 1— 2rcos (@— a) +r? 


Observe that for r < 1, 


According to the theory of Riemann-Stieltjes integrals (cf. [8], p. 589), we 


may integrate by parts and thus obtain 


(3. 9) dpi (a). 


Differentiating under the integral sign in (3.1), we find that 


dh; 1 [(1 + r?) cos (@— a) — 2r] 
or T Jo pi (4) (a, 6, r) 


y 
e 
e 
) 
t 
t 
r 
) 
(4,7 = 1, 1< 7). 
) 
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On integrating by parts with the aid of formula 305 of Pierce’s tables (third 
revised edition), we see that 
sin 


(3. 10) 


again a Riemann-Stieltjes integral. 

We shall make use of (3.9) and (3.10) to obtain a formula for | Aj; |. 
Upon replacing 7 by j in (3.9) and (3.10) when necessary, and at the same 
time replacing the parameter of integration a by B, we find that 


Ai; (1, 
1 2, [sin (@—a@) —sin (@—8)] , 


Let the total variation of p;(«) from 0 to @ be denoted by 14(a«). Recall 
also that 


(3.12) | sin (@—a) —sin =2| sin cos ( 


a—B| 


IIA 


@| sin 


With the aid of (3.12), (3.11) yields the relation 


(3.13) |Ag;(r,6)|S =f. dv;,(a)dv;(B). 


To reduce (3.13) further, we shall set 


and prove two lemmas. 
Lemma 3.1. For 


0 g(%, 0,7) 9(B, 9, r) R(a, 


(3. 14) 


First recall that 


| cos (@— a) + cos (0—B)|=2| cos 


cos 


It follows from the definitions of the terms involved that 


(3. 15) g(a, 6,1) + 9(8, 6, r) = 2R(«, B,r). 


( 

| 
<2| cos | 
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Holding (a, 8,7) fast, let J and J be respectively the sets of values of 6 on 
the interval (0, 27) for which 


(3. 16) g(%,0,r) = R(a, Br), 
and 
(3. 17) 9(%, 9,7) < R(a, 


On J, 9(8,9,r) > R(a, B,r) by virtue of (3.15). 
On I and J respectively, 


1/9(%,9,r) S1/R(a,B,r),  1/9(B, 8,7) <1/R(4, Br). 


Hence the integral in (3.14) is at most 


ut 


= 
and a similar relation follows for the integral over J. Hence the sum (3. 18) 


is at most 2R-', and the proof of the lemma is complete. 
By virtue of formula 67 of Pierce’s tables, we find that for0 << a<p<1 


and for «= 8, 
—| Ne 


We shall use this formula, representing the integral (3.19) as a function 
Z(o,p,%,8). Referring to the left member of (3.19), we see that Z(¢, p, a, B) 
is continuous in its arguments for 0<o<jp <1. We use that branch of the 
arctangent which ranges from —-/2 to 7/2. 


The following lemma is fundamental. 


LemMA 3.2. Corresponding to an arbitrary curve p° of M and a pre- 
scribed positive constant e, there exists a neighborhood N of p° on M anda 
constant 75 <1 so near 1 that for pon Nandrn<o<p<l, 


Let » be an arbitrary positive constant, and let 6 be so small a positive 
constant that whenever | «— | <8, 


(3. 21) | —vi(@)| <9 


| 
| 
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for the variations v;j(a) and v;(@’) of pi°(a). Let N be so small a neighbor- 
hood of p°® on M that (3.21) holds likewise for variations vi(#) and v;(0’) 
of pi(«) when p is on N. That such a neighborhood exists follows from 
Lemma 2. 1. 

We substitute the right member of (3.13) for | Ai; | in (3.20), thereby 
replacing the integral (3.20) by an integral with respect to a, B, r, and 0. 
We invert the order of integration, first integrating with respect to 0, using 
Lemma 3.1. Denoting the integral (3.20) by Ki;(o,p, p), we see that 


(54) 
23 29 p 
S4 f. f ~ drdv;(«)dv;(B). 
0 0 


(a, B, r) 


We next integrate with respect to r, using (3.19). We find thereby that 


The function Z(o,p,%, 8) is periodic in a and B. For each j (j =1,---, m) 
and constant h, the difference 


vi(a +h) 


is likewise periodic in «. The integral on the right of (3.22) may accordingly 


be given the form 
27 
(0,0, 2, (8). 


For each £, the interval 
(3. 23) 


will be divided into the subinterval 
(3. 24) B—8SaeS8+8 (0<8< 


and its complementary set (8). We write the integral (3.22) in the form 


B+5 2r 


We shall estimate the terms in (3.25). Let B be a bound of v;(27) as 
p varies on N. Such a bound exists since (3.21) holds for |«—¢o¢ | <8, 
and pon N. Note that Z is at most 7. Hence 


f vj(B +8) —vj(8—8) ]dv;(B) 


< (2n) B. 
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For « on J(8), we can give a sharper estimate of Z. For a on I(£), 


|a—f|=6. If ro differs sufficiently little from 1 with 0 < ro < 1, and if 
then Z <7 [a on Hence 


4 f Zdv; (a) dv;(B) 4yB?. 
Th 0 I(B) 
us 
(3. 26) wKij(o,p,p) S n| + 4B*}. 


But 7 is arbitrarily small and (3.26) holds for p on N and for 75 <a <p <1, 
provided N is sufficiently small and ry is sufficiently near 1, the choice of NV 
and 7) depending on and p°. 

The lemma follows from (3. 26). 

We have not established the uniform convergence of Qp(p) as p tends to 1, 
but the preceding lemma gives us a result equally effective in proving the 
continuity of Q(p). 


THEOREM 3.1. The area Q(p) of the harmonic surface of disc type 
defined by the boundary curve p varies continuously with p, as p varies on the 
metric space M. 


Recall that m is the number of codrdinates z;. It follows from (3.6) 
and the preceding lemma that for p on N, 


Qp(p) —Qo(p) < me (% p< 1), 


provided WN is a sufficiently small neighborhood of p°® on N and 1, is sufficiently 
near 1. Letting p tend to 1, we infer that 


(3. 27) Q(p) —Qe(p) S me (t% <1; 


For fixed « conditioned as in (3. 27), let N, C N be so small a neighborhood 
of p® that 
| Qe(q) —Qe(p’)|<e (qeM,). 
Then 
(3.28) | 2(q) —2(p’)| S| 2(q) —Qo(q)| 
+ | Qo(q) —Qo(p’)| + | Qo(p?) —Q(p’)| S me + + me, 


Thus the left member of (3. 28) is arbitrarily small if ¢ lies on a sufficiently 
small neighborhood N, of p®. 


The proof of the theorem is complete. 


4. The general case. We shall refer to various metric spaces M,, Mo, 
,M, with points X,,X.,-- +, respectively, and shall consider the 
corresponding product space II, in which a point is determined by the ensemble 
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(Xi, Let (4%, X’2,- -,X’+) be a second point in I. With 
these two points in II we shall associate a distance given by 


+ X24", + + X re 


where X;X’; is the distance in the j-th factor space. If the distances in the 
factor spaces satisfy the usual metric axioms, it is easy to prove that the dis- 
tance associated with the product space satisfies these same axioms. 

We return to the harmonic surface 8. In the general case, the region of 
definition B of the functions z; = z;(u, v) defining 8 is a bounded connected 
region of a multiply-sheeted Riemann surface & spread over the (u, v)-plane. 
This Riemann surface shall possess at most a finite number of sheets and bear 
at most a finite number of branch points. The boundary of B shall consist of 
a finite number of circles C' on & which do not intersect on the Riemann sur- 
face nor pass through branch points. Let (r, 6) be polar codrdinates in the 
(u, v)-plane in a system “ belonging to C,” that is with pole at the center of 
C and with axis parallel to the positive u-axis. On each boundary circle C 
there shall be represented a boundary curve 7; = (0). The functions 2; (u,v) 
defining S$ are harmonic over B and continuous on B. If a is the radius of C, 
and (u,0) gives the center of C, we assume that 


xri(acos 6+ = pi(6), (t—1,---,m), 


where that branch of z;(u,v) is to be used which belongs to the sheet of & 
which bears C. For simplicity we shall suppose from this point on that & is 
a single-sheeted surface bounded by the various circles C. 

Each boundary curve p will be regarded as a point in a metric space M 
with a distance function defined as in (1. 2). The ensemble p of the respective 
boundary curves will be regarded as a point in a product space with its asso- 
ciated metric obtained as above from the metrics of its factors. The (uw, v)- 
codrdinates (ui, 7) of the centers of the boundary circles C together with their 
radii will be called the circle parameters of the boundary circles. The ensemble 
b of these circle parameters will be regarded as a point in a euclidean space 
of the corresponding number of dimensions. We shall have occasion to regard 
the pair (p, 6) as a point in a product space with the corresponding associated 
metric. Similarly, the ensemble (u,v, p, 6) will be regarded as a point in a 
product space with corresponding associated metric. 

The region B and surface § will be more precisely designated by B(b) 
and S(p,b) respectively. The codrdinates x; of the point on S(p,6) will 
then be represented by functions f;(u,v,p,b) (‘=1,:--,m). Let C(b) 
designate an arbitrary boundary circle of B(b), and let 7; = p;(@) be the 


| 
| 
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boundary curve defined over C(b). If (r,6) are polar codrdinates belonging 
to C(b), if (a, 7%) gives the center of C(b), and if a is the radius of C(b), 


fi(a cos + a, asin 6 + p, b) = pi (6) (1=1,---,m). 


The functions f;(u,v, p,6) are continuous in their arguments. This 
fact is most readily established by first showing that the Green’s function G 
set up for B(6) with pole at (uo, vo) is continuous on the product space of 
points (u,v, Uo, Vo, b) for (u,v) on B and (u,v) ~ (Uo, vo). From this it 
follows by an appropriate local use of the Poisson integral that a directional 
derivative of G on B is a continuous function of the direction cosines involved 
and of the above arguments of G, provided (u,v) ~ (uo, vo). Green’s formula 
for f; then shows that f; depends continuously on its arguments. 

We shall show that the area Q(p,b) of S(p,b) is continuous on the 
space of points (p,b). Let C(b) be one of the boundary circles of B(b). 
Let (po, 6.) be a particular point (p,b). Let evaluation when b = by be 
indicated by adding the subscript 0. Let C’ be a circle on Bo, concentric with 
C, and so near C’, that Cy and C’ bound a plane ring region Ry on By. We 
shall not vary C’ with b. We shall vary b continuously among points so near 
b, that C’ is bounded away from the boundary of B(b). Let R(b) be the 
region on B(b) bounded by C(b) and C’. 

Recall that @ is the radius of C(b). This radius a is one of the codrdi- 
nates of (b). Let (7,4) be polar codrdinates belonging to C(b). Suppose 
for simplicity that R(b) is within C(b). Set 


9 


(a* —r*) 
a? — 2ar cos (@— a) + 7? 
(r<a; 


1 27 
(4.1) 


J 0 


On R(b) set 


(4. 2) fi(r cos 6 + a, rsin 6+ @, p,b) = 4, p, b). 
The difference 
(4. 3) — hi = oi (7, 6, b) 


is harmonic on R(b) and, as r tends to a, tends to 0 uniformly with respect 
to 6. Hence w; may be harmonically continued over the ring region consisting 
of R(b) and its reflection in C(b). This continuation will again be denoted 
by Wi. 

We wish to show that the area of the surface S(p, 6) varies continuously 
with (p,b). If H is a subregion of B(b), we shall refer to the area of that 
part of S(p,b) on which (u,v) ranges over H as the area of S(p,6) over H. 
To establish the continuity of Q(p, 6), it will be sufficient to prove the con- 
tinuity of the area of S(p,b) over regions such as R(b) neighboring the 
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respective circular boundaries of B(b). For the area of S(p,b) over the 
residue of B(b) depends continuously on (p,b) in an obvious manner. 
We find it convenient to set 


ai 

| Or or 
Aij(U,V) = |0U; OV; 
| 


as a matter of formal notation. To show that the area of S(p,b) over R(b) 
varies continuously with (p,6), it will be sufficient, as in § 3, to prove the 
following lemma. 


LemMMA 4.1. Corresponding to an arbitrary positive constant e, 


27 p 
(4. 4) f f | Ais drd?é e (a—n<o<p<ay), 
0 o 
provided (p,b) ranges on a sufficiently small neighborhood of (po, bo), and y 


is a sufficiently small positive constant. 


In this lemma a is a coordinate of b and varies with Db. 
Recalling that ¢; = hi + we see that 


(4.5) =Aij(A,h) + Aij(h,o) + Aij(o,h) + Aj; (0, 0). 


Our lemma would hold were ¢ replaced by h, by virtue of Lemma 3. 2. It also 
holds if ¢ is replaced by w, since the partial derivatives of w as to r and 6 are 
bounded for (u,v) on R(b) and (p,b) on a sufficiently small neighborhood 
of (po, b.). Hence it will be sufficient to show that 


2r 
(4. 6) f {| + | Acs (w, b) < e, 
0 o 
subject to the conditions of the lemma. This in turn will follow provided we 
establish the following lemma. 
Lemma 4.2. Corresponding to an arbitrary positive constant eé,, 


Ohi Ow 


and 


drdé < (a—n<o<p<a), 


provided (p,b) ranges over a sufficiently small neighborhood of (po, 60) and 
n is a sufficiently small positive constant. 
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For (p,6) on a sufficiently small neighborhood N of (po, bo) and for 
(u,v) on R(b), the quantities 


admit a bound K. As in (3.9) we find 


(a? — r*) 3 
06 Jo —2arcos (@—a) +r pi(a). 


Hence the integral in (4.7) is at most 


(a? — r?) 2r p 
a’ — 2ar cos a) +7? (a) drdv;(a). 


Hence (4.7%) holds subject to the conditions of the lemma. 
Let the integral in (4.8) be denoted by J. As in the proof of (3.10), 
we find that 


or 


Ohi a sin (6 — a) 
— 2ar cos (@—«) +r dpi(a), 


so that for (p,b) on N and (u,v) on R(b), 


Ka ¢ | sin (@— 
| 
Is — 2ar | cos (@—a)| + 7? drdédv;i (a). 


Referring to formula 67 of Pierce’s tables, we see that 


(4.9) Wd0dv, (a), 


where 
p 
W=— | arctan {8 a) | 


) alsin (@—a)| 


and where the arctangent may be taken between —-2/2 and 7/2 inclusive. 
Since W has the period x in 6, (4.9) may be written in the form 


9 Qr a+m/2 
(4. 10) J< = WdOdv; (a). 
7 0 


T e 


Recall that W = 7, and let 8 be an arbitrarily small positive constant 
<2/2. From (4. 10) we infer that 


(4. 11) 2Ka f (a) + = Wdédv; (a), 
0 70 


where (8) is complementary to the interval («— 8,a-++ 8) on the interval 
(a —2/2,a-+ 2/2). Let p be restricted to so small a neighborhood of po 


| Ou; | | 00; | 
| Or |? | 00 |? 
e 
|, 
| 
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that v; (27) admits a bound L, independent of p. Let » > 0 be so small that 
when @ is on J(8) and a—yn<p<o<a, W is at most 8. Then the sum 
in (4.11) is at most 

4KaLé + 4KasL = J. 


Since 6 is arbitrarily small, the integral J is arbitrarily small, subject to the 
conditions of the lemma. 

The proof of the lemma is complete. 

Lemma 4.1 follows from Lemma 4.2 as we have already pointed out. 
From Lemma 4.1 the continuity of the area of S(p,b) over R(b) may be 
inferred essentially as in § 3. The continuity of the area O(p, 6) then follows 
as indicated prior to Lemma 4.1. Hence we have proved the following 
theorem. 


THEOREM 4.1. The area Q(p,b) of the harmonic surface S(p, b) 
assuming the boundary curves of the set p on the boundary circles determined 
by the set b varies continuously with the set (p, b). 
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THE NON-LINEAR BOUNDARY VALUE PROBLEM OF 
THE BUCKLED PLATE.* 


By K. O. Friepricus and J. J. STOKER. 


Introduction. In this paper we develop methods for solving a non-linear 
boundary value problem which has its origin in a problem in elasticity. The 
methods yield (Part I) a complete numerical solution of the problem; we 
obtain also (Part II) the relevant uniqueness, existence, and convergence 
theorems. Although we treat a specific problem, the basic principles of our 
methods could be applied to a considerable variety of non-linear problems. 

Our problem concerns the buckling of a thin elastic plate under forces 
acting in the plane of the plate. While it is true that the lowest “ critical ” 
load at which buckling begins can be determined by solving a linear eigenvalue 
problem, the treatment of the buckling for loads beyond. the critical load 
requires of necessity the description of the situation by a boundary value 
problem for non-linear differential equations. 

Non-linear differential equations for the case of the thin plate have been 
derived by v. Karman, cf. [13]. They contain the linear biharmonic differ- 
ential operator (of order four) and quadratic terms in the second derivatives. 

We confine our discussion here to the special case of the circular plate 
under compressive forces at the boundary with radial symmetry assumed. In 
this case the v. Karman equations reduce to a pair of ordinary non-linear 
differential equations, each of the second order. The boundary value problem 
considered in this paper concerns the latter pair of equations. It is found 
that the problem depends essentially upon one parameter N—the ratio of the 
pressure » applied at the edge to the lowest critical pressure p° at which 
buckling just begins. 

In Part I we explain a procedure for complete numerical solution of our 
problem. We have carried out the numerical solution and will report here on 
some of the results.2 Our procedure yields solutions for the entire range 


* Received August 31, 1940. 

1For general theories of non-linear elasticity see Trefftz [21], Biot [1], and 
Murnaghan [16]. The theory of Biot, which assumes small strains, appears to include 
that of v. Karm4n as a special case; this theory has been applied to the buckling of 
thick plates. Murnaghan does not even make the assumption of small strains, and is 
successful in applying his theory to two special problems involving very large strains. 

2 A complete numerical and graphical discussion of these results, particularly those 
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0= N= ~; to cover this range we employ three methods which are different 
but which interlock. Each of the three methods is suitable for a particular 
range of values of NV: 1. perturbation method (sec. 2) for low values of N, 
2. power series method (sec. 3) for intermediate N, 3. asymptotic solution 
(sec. 4) for N— o. 

The perturbation method consists in developing the essential quantities 
with respect to a parameter and solving the sequence of linear boundary value 
problems which arise from the vy. Karman equations. The first such linear 
problem is nothing but the above mentioned eigenvalue problem, the lowest 
eigenvalue of which yields the critical load where bifurcation begins. This 
method is used for a rather low range of the ratio N, say 1S NS2.5; 
beyond this range the amount of numerical computation involved is excessive. 

The power series method consists in developing the essential quantities 
with respect to powers of the distance r from the center and satisfying the 
boundary conditions by solving a transcendental equation. Certain peculiar 
difficulties encountered in solving the latter equation can be overcome easily 
by using results furnished by the perturbation method. In this way one can 
obtain solutions for a higher range of values of N, say N = 15, but hardly 
for larger values of N since the labor of calculating again becomes excessive. 

The asymptotic solution (sec. 4) gives the limit situation for N > o. 
It is also characterized by a non-linear boundary value problem. The formu- 
lation of this problem is based on the occurrence of a boundary layer effect 
which in our computations had already become apparent for N < 15. The 
boundary layer effect can be roughly described as follows: While with increas- 
ing N all quantities tend to become constant in the interior of the plate, they 
change rapidly in a narrow strip at the boundary.* Once the asymptotic 
solution has been found it is possible by a perturbation method to develop 
the solution in the neighborhood of VN = o. In sec. 4 we carry out the first 
step in such a development. 

In Part I (sec. 1 to 4) of this paper our problem is treated from the 
point of view of applied mathematics. In Part II, which is quite independent * 
of Part I, we investigate our problem with regard to existence and uniqueness 
of the solutions and their continuous dependence on N for N =o. The 


of practical significance, will appear, cf. [6]. A short note discussing our results has 
appeared, cf. [5]. 

* Such an edge effect presents some analogy to Prandtl’s boundary layer phenomena 
encountered in connection with the flow of viscous fluids around obstacles (cf. e.g. 
[19]). This analogy aided us materially in finding the proper mode of attack for the 
asymptotic solution. 

* However, some of the assumptions made in Part I receive justification in Part II. 
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discussion is based on a minimum problem and its relation to the boundary 
value problem.°® 

In sec. 5 these problems are completely formulated for the case of finite 
N. In sec. 7 the same problems are formulated in terms of new variables in 
such a way as to make possible simultaneous treatment of the finite (V < 0) 
and asymptotic (NV =o) cases. We prove in sec. 8 that the minimum 
problem has at most one solution, apart from sign, and that such a solution 
never vanishes; in addition we show that a solution of the boundary value 
problem which never vanishes solves the minimum problem. To prove the 
latter statement we show that the solution of our non-linear minimum problem, 
which is of the fourth degree, is at the same time the solution of a certain 
quadratic minimum problem to which Jacobi’s transformation of the second 
variation can be applied. The existence of the solutions is treated in sec. 9 
by direct methods; in addition, we prove the continuous dependence of the 
solution on N for N = «, that is, we show that the solutions for finite NV 
converge to the asymptotic solution as N— o. This asymptotic treatment, 
however, refers only to the boundary layer effect. In sec. 10 we give a rigorous 
treatment of the limit state in the interior of the plate and its connection 
with the boundary layer. 

While it is true that the minimum problem has only one solution, apart 
from sign, the boundary value problem will have more and more solutions as 
N increases. We show, however, in sec. 6 that the boundary value problem 
has at most three solutions, one identically zero, the others differing only in 
sign, provided that NV is not too large. The method used combines well-known 
facts concerning linear eigenvalue problems with geometrical reasoning and 
could be applied to a more general class of non-linear problems. 

Appendix I is devoted to an investigation of the onset of buckling from 
the point of view of E. Schmidt’s bifurcation theory (cf. e. g. [14]) ; this leads 
to a justification of some assumptions made in working with the perturbation 
method. 

PART I. 


1. Formulation of the problem. We first introduce the v. Karman 
equations (cf. 13), reference to which has been made in the introduction. 
They are the following pair of non-linear differential equations for two 
functions ¢ and w of the variable x and y: 

(1. 02) (yh)? V iw + (dyyWoe — + = 0. 


5 For a different class of non-linear boundary value problems and their treatment 
by minimum problems, see Hammerstein [10]. 
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The quantity w is the deflection of the middle surface of the plate and ¢ is 
the Airy stress function, from which the stresses in the middle surface (the 


“membrane ” stresses) are derived from the relations 


(1. 03) = Cry = — $ry, Tyy = ez; 


implying that the equations of equilibrium are identically satisfied. Subscripts 
x, y, and r on all quantities but stresses o and strains e« denote differentiation 
with respect to these variables. (What we call stresses here-are to be under- 
stood as stresses per modulus of elasticity #). The stresses in the middle 
surface are also the averages of the stresses over the thickness of the plate. 
As a consequence of the hypothesis that the squares of the slopes of the middle 
surface are of the same order of magnitude as the strains in.the middle sur- 
face, the following non-linear relations hold between the strains e, stresses o, 
and displacements u, v, w of the middle surface 


Ur —=— (1 + v)ore + v(ore + oy), 
(1.04) =F (Uy + V2) + = — (1+ v) ory, 
= Vy + y= — (1 + Vv) + + Cyy)- 


The expressions for the strains here differ by the quadratic terms in wz, wy 
from the expressions of the linear theory. The quantity v is Poisson’s ratio. 
Equation (1.01), the compatibility equation, is obtained from the second 
and third members of (1.04) as the result of eliminating wu and vw. Equation 
(1.02) is the same as that obtained from the usual linear bending theory of 
plates; in (1.02) h is the thickness of the plate and y? = 1/12(4— *. 

We shall take as boundary conditions on ¢ and w those corresponding to 
the following physical situation: Only one external force is applied—a uniform 
normal pressure at the edge and in the plane of the plate. The plate is simply 
supported, i.e., the deflection at the edge is zero and no bending moment is 
applied there. Our methods can, however, be extended with obvious modifica- 
tions to other cases, e. g., the case of the clamped plate. 

The essential restriction of this paper is that we consider only the circular 
plate with radial symmetry. As domain of the variables 2 and y we take 
therefore the circle r? = 2? + y*? S R* and assume that ¢ and w depend upon 
r alone; in this case (1.01) and (1.02) become ordinary differential equations. 
We introduce the new functions p—1"¢,, =— Fr and the linear 


differential operator 


(1. 05) G = — — 


The quantity p represents physically the membrane stress on an element 
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perpendicular to the radius. The differential equations (1.01) and (1. 02) 
become 


d 

+ pq] = 9, 
where 7 = yi/F; integration of these equations yields 
(1. 06) Gp — = ¢,/?’, (1. 07) Gq + gp = 62/1". 


Unless the constants c, and c. are zero, the original functions ¢ and w could 
not have continuous fourth derivatives, since continuous fourth derivatives of 
¢ and w at r =O imply, on account of radial symmetry, continuous second 
derivatives of p and qg with respect to r and hence continuity of the left 
members of (1.06) and (1.07) at r=0. We therefore assume c, = c, = 0 
and take as the fundamental differential equations® for our investigation 


(1. 08) Gp = 49’, (1. 09) n° Gq + pq = 0. 
As boundary conditions at the center we take 


dp dq 
(1. 10) 0, (1.11) 0 for r=0 


which must be satisfied, again because of symmetry, if @ and w are to possess 
continuous fourth derivatives. The physical situation which we consider leads 


to the following conditions at the boundary: 

(1. 12) p=p for r=—R, 

where j is a prescribed positive constant; 

(1. 13) (Rk) (Rk) + A+ v)q(k) =0 (r=f), 


which implies vanishing of the bending moment at the edge. 

The equations (1.08) to (1.13) constitute the formulation of the 
boundary value problem to be discussed in what follows. 

Once the functions p and q have been determined, all quantities of 
interest from the physical point of view are easily obtained. In particular, 
the most important stresses, the circumferential membrane stress pe and the 
radial bending stress p, at the lower surface of the plate are given by 


(1. 15) De rpr + p=Bop, and (1.16) po = 


where the operator By is defined in (1.13). 
Our problem would seem to depend upon several parameters; but only | 


® These equations are simpler in form than those used by others, for example Nadai 
[17, p. 288] and Way [22], who work with quantities other than p and q. 
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two are really essential, namely, Poisson’s ratio v and what might be called 
the “thrust ratio” 


(1.17) 
This becomes evident upon introduction of the quantities 


into equations (1.08) to (1.14) as well as (1.05), which then assume the form 


aR 05)* G d 


(1.08)* and (1.09)* Gp* = $q*?, Gq* + p*q* = 0, 
(1.10)* and (1.11)* for 
(1.12)* and (1.13)* p* = d?, Byg* =0 for r—1. 


(Note that the parameters A and v occur only in the boundary conditions). 
It follows that two plates with the same values of A and v possess solutions 
differing only by constant multipliers. 

The material constant v may have any value between zero and one; 
usually it is about 0.3. In general, variations in v affect the solutions of 
problems in elasticity very little. We therefore took a convenient fixed value 
(0.318) for y in our numerical calculations. Hence only A remains as the 


essential parameter (cf. *). 


2. Perturbation method. Our boundary value problem has the obvious 
solution g =0, p=const. This is also the only solution for sufficiently 
small values of the parameter A—a fact which we prove in sec. 6 (cf. Theorem 
6.1). At a certain value A, of A there will be a bifurcation; for A > A» two 
solutions for which g #0 will exist; these solutions differ only in the sign 
of g. The onset of such a “ buckling” is usually treated by linearizing the 
v. Karman equations (i. e., by setting the right member of (1.01), or, for us, 
of (1.08) equal to zero) ; the result is an eigenvalue problem for which the 
lowest eigenvalue is Ao. For information respecting what occurs for A > Ao, 
it is necessary to solve the non-linear equations. Our problem can be solved 
for A beyond A» by the perturbation method. One finds that the above lineariza- 
tion is nothing but the result of the first step of this method. 

The perturbation method consists in developing p, p, and q with respect 
to a parameter «. Since solutions (p,q) must appear in pairs (p,q), (p,—q), 
it is natural to assume developments of the following type:? 


7 That there exist solutions given by expansions of this type can be shown by the 
methods of the bifurcation theory of E. Schmidt (cf. App. I). 
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p= p+ ep? + Ap) 
p(r) = p'(r) + ep (r) + (r) 
g(r) = eq? (r) + (r) + g(r) 


From j(e) we then obtain ¢ as a function of p. 

We substitute the above expansions into the differential equations (1. 08) 
and (1.09) and the corresponding boundary conditions. By collecting terms 
of the same order in e we obtain a sequence of linear differential equations for 
p?,q™, accompanied by boundary conditions. For p° we find 


the only solution of is =const., i.e., =p. The value of p® is, 
however, not yet determined. The next equations are 


+ Brg™(R)=0, gr (0) = 0. 


This homogeneous differential equation with homogeneous boundary condi- 
tions will have as sole solution g“) =0 except when p® is an eigenvalue. 
The system serves as an eigenvalue problem for fixing the value of p® = p®; 
it is the same problem as would result from the customary linear treatment 
of stability, in accordance with the above remark. The lowest eigenvalue 


(2.1) p° = 


characterizes the thrust at which buckling begins and the solution bifurcates. 
We are now in a position to introduce the parameter NV by 
N will be used frequently as the essential parameter (cf. end of sec. 1) in 
place of 
The solution g) of the differential equation in ZL, satisfying g-) (0) =0 
is easily found in terms of the Bessel function of order one: 


(2. 3) = 


where a is a constant which we may choose arbitrarily. The lowest value of 
for which the boundary condition Byg") = 0 is satisfied, is 


(2. 4) Ao = 2.06, when v —.318.° 


In this manner the lowest critical value A,* = 4.2436 of the thrust ratio is 
determined. 


8 This value of vy is a convenient and reasonable one; we have used it throughout 
our numerical calculations (ef. end of sec. 1). 


| 
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The equations for p‘) are 
Is: Gp =4(q™)?, =p, (0) — 0. 
Using (1.08) and (1.10) we obtain 


By a second integration one obtains p‘*), except that p‘?’ has not yet been 
determined. It will, however, be fixed in the next step. 


The problem for q‘* is: 
Gq? + pq = — pq, Byq® (R) = 0, qr(0) 0. 


The differential equation here is non-homogeneous, but the corresponding 
homogeneous problem, which is the same as JZ, has a solution not identically 
zero; thus Lz presents the exceptional case in which the non-homogeneous 
problem need not have a solution and, if it has, the solution is not unique. 
In order that LZ; possess a solution, the right-hand side — p‘*)q®) must be 
orthogonal to the solution gq‘? of the homogeneous problem. That is p‘*? 
must satisfy the orthogonality condition 


R 
p’?)(q®)*rdr = 0. 
0 


This relation serves to determine the boundary value p‘). 

Having satisfied the preceding condition we are sure that LZ; has a solu- 
tion. Let ¢‘) be one suclt solution; it follows that all functions of the form 
g®) =q® + aq are also solutions. The constant @ is, in principle, 
undetermined.’° We may choose « arbitrarily, but we wish to choose it in 
such a way as to obtain a good approximation to the complete solution. 


® While this integral is expressible in terms of Bessel functions: 


> 

dr 
p™ itself, unfortunately, appears not to be expressible in terms of known functions. 

10 This does not mean that the final solution is not uniquely determined, but refers 
to the fact that the parameter e can be chosen in different ways. Instead of e one might 
just as well have taken for parameter any function of ¢ of the form 

The development 

is then equivalent to 


q=eq? + fag?) 


— 
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The Rayleigh-Ritz method is a natural procedure for this purpose. By 
inserting g = + a*)q) + into the strain energy functional?! V(q) 
for fixed p we obtain a function of « and «. Since q can be characterized 
(cf. sec. 5) as the function minimizing V(q), a suitable relation between « 
and ¢« can be obtained by minimizing V considered as a function of a. The 
resulting condition for a, however, is a rather complicated cubic equation with 
coefficients depending on e; instead of solving this equation, we develop it in 
powers of « and retain the term, of lowest order, which incidentally is linear 
in a We choose for a the value obtained from this linear equation.*” 

Having fixed the constant «, and thus q‘*), we proceed to determine p‘*) 


and p‘*) from the equations: 
be: Gp = gq), p' (R) pr* (0) = (), 
Ls: Gq + pg’) — — Byq? (R) ==(), (0) = (), 
Here we face the same problem as that encountered above: the boundary 
value p‘*) must be determined from the condition that Z; have a solution, 
while g‘*) is determined only within a multiple of qg@), which multiple may 
be chosen arbitrarily, or fixed by a Rayleigh-Ritz procedure as above. 

The general type of equations to be solved is: 
Ln: == f(s) (7), (nm even), 
where the expressions f and f‘"*) are quadratic in previously determined 
functions q‘) and with i<nand j= n. Equations of the type LZ, can 
always be solved by two quadratures, which is obviously not the case with 


11 The functional V is defined and discussed in sec, 5. 
12 The linear equation is 


R R 
[2 Pr??? dr + ) 
0 0 
R-*p (q®))*redr 
0 


R R R 
—[6 f Pr) 9,18) — (2) rdr = 0, 
0 70 0 
where 
and 


We used this equation in our calculations with results which seemed to justify the 
labor involved in such a procedure. 


848 K. 0. FRIEDRICHS AND J. J. STOKER. 


Ins1. However, Im,, can be transformed by introducing the new dependent 
variable wu‘) = q(™*")/qg™) and it then takes the form 


a 
dr 


The first integration yields 


(n) r 
dr 0 ; 


A second integration determines u‘”) and consequently q‘"*!). The boundary 
condition Byg‘"*)(R) —0 is transformed into u,“ —0 for r=R, i.e., 


R 
fl dr = 0, and this relation holds automatically since it expresses 
0 


the condition that Dn,, have a solution satisfying the boundary conditions. 
We carried the solutions so far as to calculate q‘* and p"®. 

Since all quadratures, after the first, appear not to be expressible in 
terms of known functions, we found it necessary to operate with power series. 
Although the integration of power series presents no difficulty, the obvious 
necessity for multiplication and division of one power series by another makes 
the numerical calculations very laborious.** 

The rapidity of convergence of perturbation series is a matter of general 
significance, in view of the wide use of such methods in problems similar to 
ours [cf. 9, 15, 18].1* In our problem it is possible to check the accuracy of 
the perturbation method by comparison with the results of a different method 
(cf. sec. 3). Our calculations show that the perturbation series converge satis- 
factorily only for a rather small range of values of the ratio N = p/p® = d?/d)’. 
If errors up to about 4% are permitted, g may be calculated for N = p/p°® 
= 1.15, 1.8, 2.5 by using terms up to and including those of first, third, 
and fifth order respectively; p may be calculated for N = 1. 4, 2.2, 2.8 by 
using terms up to and including those of second, fourth, and sixth order 
respectively. For the stresses p, and pe (cf. (1.16) and (1.15)) the con- 
vergence is not quite so good. Beyond these values of N the approximations 
become inaccurate rather quickly. 


18 We retained eight terms of each power series and found this just accurate enough 
to compute q® and p™, with which our calculations ended. 

14 Marguerre [15], for instance, mentions that his calculations for the rectangular 
plate, which implied two perturbations, may perhaps be valid up to N = 20. In our 
simpler case we find that values of N of such magnitudes cannot be treated with three 
perturbations. In fact, such values of NV lead to solutions already in the asymptotic 


range (cf. sec, 4). 
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3. Power series method. The solutions of equations (1. 08) and (1. 09) 
can be obtained as power series in r. All coefficients of the series could be 
fixed in terms of the first coefficient in each series, the latter being determined 
by the boundary conditions. This would involve solving very complicated 
transcendental equations. It is possible, however, to transform the problem in 
such a way that only one transcendental equation of relatively simple type 
need be solved.*® 

We introduce the new independent variable 

AR-r, A; 
and the new functions 


(3. 1) x == A“q/y, (cf. (1.18) 


where A is a parameter at our disposal. The differential equations (1. 08) 


and (1.09) assume the form 


3.2 
with boundary conditions 
lor | ° dk | 

da da | 
(3. 4) Bu =0 for a=—A, 
(3. 5) a(A) 


Instead of prescribing + and A we proceed as follows: 7(0) and «(0) are 
chosen, A is determined by solving (3.4), and z is calculated from (3.5). 
The power series for w and x contain only even powers of a, a conclusion 
that can be drawn from (3.2) and (3.3) ; we write, therefore, 
x as 
oL 
r= = > 
k-0 k-0 
Upon insertion of these power series into (3.2) we obtain the following 


recursion formulas for 7; and xx: 


2k (2k + 2)m, 2h (2k + 
m+n=k-1 m+n=k-1 


18 The power series method (with a different arrangement) was used by Hencky 
[11] and Way [22] for the problem of the bending of circular plates under lateral 


loads or edge moments. 


— gg? — = OK", 
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After having chosen +(0) =, and x(0) =x, the successive coefficients are 
calculated by these formulas. Equation (3.4) becomes 


(3. 6) > (2k + 1+ v) 0, 
k=0 


and must be solved for its lowest root *® «A; p* and q* are then calculated 
from (3.1). 

By this method we obtained solutions for-a much higher range of values 
of N with very much less labor in numerical computation than would be 
required by the perturbation method. However, it was necessary to calculate 
a rather large number of coefficients in order to obtain sufficient accuracy for 
the higher values of N, e. g., for VN = 14. 7, the most extreme case calculated, 
we found thirty terms in each series barely sufficient. 

In applying the power series method for the higher values of N, the 
following considerations are essential. We wish to obtain a fairly even distri- 
bution for the values of NV and at the same time we want to know roughly 
where the solutions « = A of equations (3.6) will be. The amount of labor 
in computation reduces to a minimum if A~1: for then we have a means 
of knowing the accuracy with which the coefficients need be calculated and 
also the number of terms to be computed in the series. In order to obtain 
A~ 1 and a pre-determined distribution of the values of N, it is necessary 
to make in advance fairly accurate estimates of the values of p* = p/y? and 
q* =q/n at the center (r =0). How to do this is not obvious, for the values 
of p and q at the center change with increasing N in rather surprising and 
unforeseen ways, as our computations showed. This is one disadvantage of 
the power series method which, however, can be overcome by beginning with 
the perturbation method (which requires no estimates in advance), and pur- 
suing the latter until the trend of the solutions with increasing N becomes 
apparent. The solutions obtained above by the perturbation method for 
N = 2.5 proved in fact to be amply sufficient for this purpose, although not 
all of the distinctive qualitative features of the solutions, in their dependence 
upon NV, had yet appeared in this range. 

In comparing the perturbation and power series methods, it should be 
mentioned that the former is applicable in principle to similar problems 
without radial symmetry; this appears not to be the case with the power 


series method. 


4. Asymptotic solution. Although the power series method can be 


16 Tf we were to take the second root of (3.6) for A, we would find the prolongation 
into the non-linear range of the second eigenfunction of the linearized problem L, (sec. 2). 
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applied to solve our problem for rather high values of N, it will not serve to 
determine what occurs when N tends to infinity, or, what is the same thing, 
when the thrust ratio A* = p/y* tends to infinity. Such a passage to the 
limit may be achieved physically in various ways which are mathematically 
equivalent: for example, one might take a fixed plate (7 fixed) and allow p 
to increase indefinitely, or hold p fixed and consider plates with slenderness 
ratios » tending to zero. 

In order to determine the asymptotic behavior of the solutions, it is 
necessary to formulate a limit boundary value problem by a passage to the 
limit in the original differential equations and boundary conditions. A simple 
and rather natural procedure would be to hold p fixed and let y tend to 0 in 
equations (1.08) and (1.09) which then take the form 


(4. 01) Gp'= 3¢’, pg = 0. 


The only solution of these equations which satisfies the regularity conditions 
(1.10) and (1.11) is g==0, p=constant. One is then tempted to fix this 
constant by setting it equal to the prescribed value p at the edge. This means 
that in the limit there would be a hydrostatic compression (p > 0) throughout 
the plate. Such a procedure corresponds to the treatment of laterally loaded 
clamped sheets by Hencky [11, 12], cf. also [4,2], and a similar method may 
well be legitimate in cases where no edge compression is prescribed. Applied 
to our case, however, wrong results would be obtained. The correct limit pro- 
cedure can be found only by a deeper analysis of the nature of the solutions. 

In our case we have found by numerical calculation (cf. [5] and [6]) 
that with increasing thrust ratio the membrane stress p approaches a state of 
constant tension (p <0) over an increasingly large part of the interior of 
the plate and that the transition from tension in the interior to the prescribed 
compression » > 0 at the edge takes place in a narrow strip, the breadth of 
which decreases with increasing A. (We shall use the parameter A instead of 
N from now on). 

These results of the numerical calculation indicate strongly the nature 
of the limit situation. In the interior of the plate, the above solution q=0, 
p= const. of (4.01) seems valid, but the constant should not be determined 
by setting it equal to p. The constant can be fixed only by an investigation of 
the transition phenomena in the “boundary layer.” The boundary layer phe- 
nomena are coupled, with the fact that the order of the system of differential 
equations (1.08) and (1.09) has been reduced from four to two on passing to 
(4.01). It is not to be expected that the solutions of the limit system (4. 01) 
can satisfy four boundary conditions. The above discussion indicates that the 
lost boundary conditions are those at the edge. 


e 

» 
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A treatment of such an edge effect requires that the scale be stretched 
with increasing A in such a manner that the width of the edge strip, or 
boundary layer (as measured in the new scale): does not shrink to zero. This 
will be accomplished, as we shall see, by introducing the new independent 


variable 

(4. 02) 0=B=A, 
where 

(4. 03) A= p/n =AVN. 


Subscripts 8 in what follows denote differentiation with respect to 8. Upon 
introduction of 

(4. 04) P = p/j, Q =nq/p 

as new dependent variables, the original differential equations (1.08) and 


(1.09) take the forms: 
Ppp — [38/(A— B) Pp = Q?/2 


— [3/(A— 8) + PQ =0. 

The boundary conditions (1.12) and (1.13) become 

(4. 06) P(0)=1, Qe(0) —[(1 +») /A]Q(0) —0, 
while the regularity conditions (1.10) and (1.11) take the form 
(4. 07) Ps(A) = Qa(A) = 0. 


We now let A tend to infinity and obtain the limit differential equations 
(4. 08) Ppp = Q?/2 
(4. 09) + PQ =0; 
the boundary conditions (4.06) become 
(4.10) Qg(0) =0, (4.11) P(0)—1, (4.12) =0. 
The equations (4.08) to (4.12) constitute the formulation of the limit 
boundary value problem concerned with the “ boundary layer.” (This is not 
to be confused with the limit problem partially formulated in (4.01), which 
is concerned with the interior of the plate.) The present problem has the 
trivial. solution P = 1, Y = 0; if it has another solution P, Q 50, then also 


P,—Q isa solution. Hence the sign of is arbitrary. 
The equations (4.08) and (4.09) possess the first integral : 


(4, 13) Qp? — Pg? + PQ? = const. 


| 
| 
| 
| 
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It is plausible to take zero as the value of the constant. This, in view of the 
boundary conditions (4.12), is equivalent to assuming PQ? —0 at B= oo. 
(For a justification of this assumption see **). We take, therefore, 


(4. 14) Og? — Ps? + PQ? =0. 


In view of (4.10) and (4.11) we find Ps? = Q? for B=0, or Pp= + Q. 
Since we may choose either sign for Q, we set 


(4. 15) Pa(0) =— Q(0). 


We introduce new variables z, y, z (not to be confused with the space 
variables used earlier) as follows: 


(4. 16) == 0=27 6, 
(4.17) y =—o’P, 
(4. 18) z= 


where € and » are numbers to be determined. The differential equations 
(4.08) and (4.09) in these variables are: 
(4. 19) +2 =0, 
(4. 20) yz = 9. 

The introduction of the new variable x has the effect that the resulting 
differential equations (4.19) and (4.20) possess solutions expressible as 


power series in 


ne 


(4. 21) y => (—1)*yx* 
k=0 

(4. 22) > 
m=0 


Upon substituting (4.21) and (4.22) into (4.19) and (4.20) we find 
the following formulas for y; and Zm: 


(4. 23) (2h) = ZmZny 
m+n=k-1 
(4. 24) (2m +1)?2zn—= 
n+k=m 


There is only one arbitrary coefficient, namely z, to which we assigned the 
numerical value z= 4. The coefficient yo is determined from (4.24) for 
m==() and has obviously the value yo 1. This fact makes it possible to 
re-write (4. 24) as a proper recursion formula: 


ed 
or 
Lis 
| 
| 
| 
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m-1 


(4. 25) 2m (2m a 2) Zn = > > ZnYm-n- 


n=0 


We found it amply sufficient to compute ten terms in each series. 

We turn now to consideration of the boundary condition associated with 
(4.19) and (4.20). The regularity conditions (4.12) are satisfied auto- 
matically in view of (4.16), (4.17), (4.18), and the assumed development 
into the power series (4.21) and (4.22). The boundary conditions (4. 10) 
and (4.11) become in the new variables 


(4. 26) zo(é) =0, (4.27) 


The first is a transcendental equation in €, to be solved for its lowest root, 
which is found to be € = .98618.1" This value inserted in (4.27) determines 
», Which is found to be wo = .68754. 

Once € and w are determined, the limit boundary value problem is solved 
in principle. The function P(f) starts with the prescribed value P(0) = 1, 
decreases monotonically, assumes the value zero at B = .941, and approaches 


the value P( oo) =—w* = — .47271 as B— o, the latter value resulting 
from (4.17). The function Q(8) decreases monotonically and approaches zero 
as B tends to For Q@(0) and P(0) we find Q(0) = — P,(0) = 1.61436, 


thus checking (4.15). 

We can now discuss the connection ** between our results from the 
boundary layer theory and the limit procedure for the interior of the plate 
described above. The inner edge of the boundary layer is to be identified in 
the limit with the outer edge (r—> /) of the interior region. Since the value 
P(«) =—vo? is the limit value of p/p at the inner edge of the boundary 
layer, —w*p is the proper value to be taken for fixing the essential constant 
for the limit problem in the interior of the plate.t® Thus we see that the 
limit membrane stress p in the interior is a tension. The value Q(o) =0 
is the limit value of g/p at the inner edge of the boundary layer and this 
result is also consistent with the solution gq = 0 of the equations (4.01). 

This solution of the “asymptotic” boundary value problem furnishes 
limit values for all quantities. From the piysical standpoint it is of especial 
interest to discuss limit values of those quantities which give information on 
the ultimate stress distribution. The value Q(0) is the limit value of »@/p 


17 The reason for assuming 2, == 4 was to make &- 

18 A rigorous proof of the validity of the limit process tor the interior of the plate 
is given in sec. 10. 

1° This procedure differs from that in Prandtl’s boundary layer theory: there the 
limit state in the interior furnishes a quantity which must be used to determine the 
solution of the boundary layer equations. 


| 
| 
| 
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as one sees from the definition (4.04) of Q. The value Pg(0) is the limit 
value of npc/p*/? = pc/pA, where po=Rp,-+p (cf. 1.15) is the circum- 
ferential membrane stress at the edge of the plate; this follows for A— 
from the identity 

(4. 28) = — (1— B/A) Pp + 

where P is defined in (4.04). For the radial bending stress py (cf. 1.16) we 
obtain from (4.04) the formula 


6y[ (1 — B/A) Qs — (1 +») 
The stress p, attains its maximum at the point where £ satisfies 
(1 — B/A) — (2 + v)Q/A = 0. 


In the limit (A— o) this equation reduces to Qgg =0; in view of (4.09) 
it is satisfied at the point 8B = .941 where P(B) =0. It follows that in the 
limit the point of maximum bending stress lies in the boundary layer. As a 
result, we find from (1.28) that the limiting value of p,_.,/pA 1s 
— 6yQp(.941) = 1.1123. 


(4. 29) 


Asymptotic development. We proceed to explain a method of develop- 
ing our solutions P and Q in the neighborhood of A= «. This leads to 
approximate formulas for rather large values of A. Although we have not 
proved the validity of such a development, our numerical results indicate 
strongly that it is justified. 

We confine ourselves to the first step of such a development; it could be 
carried further. We assume that P(8) and Q(f) considered as functions of 


x =" possess first derivatives with respect to x at x = 0, namely 

(4.30)  8P(B) P(B)|™, —-8Q(8) = — (8) |™. 
Ox OK 


Differentiation of equations (4.05) and (4.06) with respect to x for x0 
leads to the linear differential equations 


(4. 31) — = 

(4. 32) + P8Q + = 

for 8P and 8Q, and the boundary conditions 

(4, 33') §P(0) =0, 

(4. 34) = (1+ where Qo=—Q(0). 


Differentiation of (4.07) with respect to x yields 


8Pg(x-?) —x?Ppp(x?) =0 and 8Qg(«*) = 0. 


ith 
to- 
mnt 
0) 
ot, 
ed 
les 
1g 
ro 
6, 
1e 
te 
in 
le 
it 
Q 
al 
e 
e 
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Our numerical calculations indicate that the second terms in each of the 
latter equations tend to zero as x-—>0. We assume, then, as boundary 


conditions 
(4. 35) 0) —0, (4. 36) =0; 


this assumption seems justified by the results. In equations (4.31) to (4.34) 
the functions P(8) and Q(f) refer to the solutions of the limit boundary 
value problem treated earlier in this section. 

Before solving the differential equations we note the relation 


(4. 37) +8Q—=6/5 for B=0, 


which will serve as a useful check on the solution. To prove (4.37) we start 
with the identity 


(4.38)  [Pp8Ps — — PQ8Q — Q°8P/2]p — 3 (Ps? — 
which follows from (4.31), (4.32), (4.08), (4.09). Integration yields 


(4.39) — PQ8Q — Q°8P/2|°——3 — Qs") dB, 


where the boundary conditions for 8B = « have been used. The right member 
can be evaluated: we multiply (4.08) by P, (4.09) by — Q, add, and inte- 
grate from 0 to o. The result, after integration by parts, and use of the 
boundary conditions for 8 = o, can be written 


(Pe — + 3Q°P/2) dB =— PPg + QQz|° = Q(0) = Qo, 


where (4.10), (4.11), and (4.15) have been used. In view of (4.14) this 


vields 


co 
(4. 40) (5/2) (Ps? — dB Qo 
Insertion of this in (4.39), use of (4.10), (4.11), (4.15), and division by 
— Qo gives 
(4, 41) + 80 + $Q08P|\° = 6/5 


and (4.37) follows from (4. 33). 

It is possible to obtain solutions of the homogeneous differential equa- 
tions (4. 31)> and (4.32), derived from (4.31) and (4.32). One considers 
a one-parameter set of solutions P(B,«), Q(8,«) of the asymptotic differen- 
tial equations (4.08) and (4.09); the derivatives of P and Q with respect 
to « satisfy the homogeneous equations (4.31) ) and (4.32) , as one easily 


verifies, 


| 

| 

| 
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One sees readily that P(B + «), Q(B +) is such a set; the derivatives 
of these with respect to a for «= 0, i.e., 


(4. 42) dP = Pp, 50 = Oz 

constitute, therefore, solutions of (4.31) and (4.32). We note the following 
values *° 

(4.43) 8P=—Q, 8Q0=0, for B=—0, 
which follow from (4.10) and (4.15). 

A second one-parameter set is formed by «?P(aB), a?Q(aB), as can be 
readily verified ; the derivatives of these with respect to « for «—1, 1.e., 
(4. 44) 8P—BPg+2P, 80—BQs+20 
form a second solution of (4.31) ) and (4.32). We note the following 
values *° for this solution: 

(4. 45) P=2, Q=20., P=—3Q0., for B=—0, 
which follow from (4.10) and (4.15). 

We proceed now to construct a solution of the non-homogeneous equa- 
tions (4.31) and (4.32) satisfying the conditions (4.35) and (4.36). 
Since these conditions are satisfied by the above solutions of the homogeneous 
equations (4.31), and (4.32) ) we may satisfy the remaining boundary con- 
ditions by adding an appropriate linear combination of the latter solutions. 

We introduce the new variable x = ge“ as in (4.16) and the new 
functions dy and 8z by 
(4. 46) = 89 =—3V 2wdz. 

With the notation of (4.17) and (4.18) equations (4.31) and (4.32) become 
(4. 47) + 2282 = 
(4, 48) 2 — — By = 


Assuming for dy and 8z the power series 


(4. 49) by = (— 1) ya, 
(4. 50) = (— 1) M32 


m=0 
and the series (4.21) and (4.22) for y and z we obtain by insertion in (4. 47) 


and (4.48) the relation 


20 We observe that the left member of (4.41) vanishes, as it should, for these values. 


12 


e 
| 
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(4. 51) (2h) 2 kyx, (k i, 2, *). 
n+m=k-1 

(4. 52) (2m +1)*82m = + 2ndyx) + (2m + 1) 2m, 
k+n=m 


(m =0,1,° 


The coefficient of 5¢m in the right member of (4.52) is yo ==1; hence we 


may replace (4.52) by 
m-1 


(4.53) 2m(2m + 2)82m = Ymnden + + (2m + 1) 2m, 


n=0 k+m=n 
(m =0,1,2,: °°). 


This and (4.51) constitute proper recursion formulas for the series (4. 49) 
and (4.50). The coefficient dy is fixed by (4.53) for m 0; its value is —1. 
The coefficient can be chosen arbitrarily; it is convenient to take = 0. 
The values y;, and 2m have been previously calculated from (4.23) and (4. 24) ; 
the coefficients dy, and 82m can then be determined from (4.51) and (4. 53). 
The convergence of the resulting series for dy and 8z appears to be very 
satisfactory. 

In this way we find a particular solution 8P, 8Q of the non-homogeneous 
equations (4.31), (4.32); by adding a proper linear combination of the two 
previously found solutions of (4.31), and (4.32) we obtain a solution of 
(4. 31) and (4.32) satisfying the conditions (4.33), (4.34). We give a few 
numerical results: ** 

sP=0, 80 =—.74, 5Pg—1.94, 80g —2.13 for 

sP =— 1.03, 80—0, 8Pgs—0, 80,—0 for 
Hence we have the following approximation formulas for large » (that is, 
for large N): for B=0 

P=1, Q=—1.61—.74/A, Pg =—1.61+1.94/A, Q =—2.13/; 
for B = « we have 

P=— A?—1.03/A, Q=0, Pp=0, Op=—0. 


It is clear that the preceding calculations yield also approximate formulas for 
all physical quantities such as stresses and deflections. 


PART II. 


In Part II of this paper we consider the purely mathematical aspects of 
the boundary value problems which were discussed in Part I from the point 
of view of explicit numerical solution. 


21 Note that our results check with formula (4. 37). 
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5. Minimum and boundary value problems for the finite case. Our 
subsequent discussion will be based upon the possibility of formulating mini- 
mum problems equivalent to our boundary value problems.” In this section 
we consider the problems for finite A. The functional to be minimized, essen- 
tially the total potential energy,”* is 


R 
R R 
—pR* + pr? (r) redr. 


We do not vary p and q independently. Rather, g must be varied while p is 
considered a functional in g through the differential equation (1.08) and the 
boundary conditions (1.10) and (1.12). 

It is convenient to work with the invariant quantities p*, g*, and 1. 
defined at the end of sec. 1, but we omit the * from now on. We introduce 


the functionals 


(5. 02) qHq = q?(r)redr, 
0 
a1 
(5. 03) qDq—= + 
1 
(5. 04) p2(r)edr, 
0 
and 
(5. 05) Vilq] = —A°qHq + prK pr. 


The new functional V, to be minimized is related to (5.01) through 
V = 
We define p(r) and p,(r) as functionals in g(r) by means of the formulas 


(5.06 


(5. 07) r(r) drs. 


We are now in a position to formulate admissibility conditions. By 
admissible functions we mean functions q(r) continuous in 0 << r=1 which 
possess L*-integrable ** derivatives in every interval 0 << and for 


22 Incidental use of this fact was made in see. 2. 
*8 The total potential energy is given by 


mhE[V — (1— v) R*p?]. 
*4 For a method avoiding Lebesgue derivatives which could be applied here, see [8]. 


)) 
he 
0. 
y 
ls 
‘0 
yf 
W 
| 


860 K. 0. FRIEDRICHS AND J. J. STOKER. 


which the integrals H, D, and K are finite, p, being considered a functional 
in g(r) through (5. 06). 

The minimum problem M) is that of minimizing V,[q] with respect to 
admissible functions q(r). 

The first variation 6V) of V) is 


(5. 08) 8Vi[q] = qD8q — A°qHdq + 


where the bilinear forms corresponding to (5.02), (5.03), and (5.04) have 
been used ; 8g is any admissible function and 


(5. 09) gdqr,tdr,. 


V, is said to be stationary for a function g if 8V,[q] = 0, for all admissible 
dq. We refer to the problem of making V) stationary in this sense as the 
problem S,. A solution of M) is also a solution of 8); this conclusion results 


from the standard reasoning. 

We formulate the boundary value problem B) as follows; an admissible 
function g is to be found which possesses a continuous second derivative and 
satisfies the differential equation 
(5. 10) Gq + pq = 0, 
and the boundary condition 
(5. 11) Byg=rqr + (1+yv)q=0 for r—1. 


The function p in (5.10) is defined by (5.07) and (5.06). These conditions 
) 


imply that p satisfies the differential equation 


(5. 12) Gp = q’/2, 
and the boundary condition 
(5. 13) p=? for r<—1. 


The conditions = 0 at r=0 which were used in calculating the solu- 
tions in sec. 1 are not required ; we have sufficiently characterized the behavior 
of p and g at r=0 through the admissibility conditions * that the integrals 
H, D, and K be finite. 

The problems 8, and B, are related as follows: 1) A solution of By 
solves S,. 2) If an admissible function q solves S) it possesses a continuous 


second derivative and solves 6). These statements are proved in sec. 7 (cf. 


25 Tt could be shown that a solution of the boundary value problem which satisfies 
the admissibility conditions would also satisfy the regularity conditions at r= 0. 
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Theorems 7.1 and 7.2) in terms of convenient new variables, which at the 
same time permit proofs of the corresponding statements for the asymptotic 
case. We may mention here that the connection between the two problems is 


based on the Green’s formula 
1 
(5. 14) (Gq + py) + Bvgdq|*, 
which, in terms of new variabes, is justified in sec. 7 (cf. formula 7. 21). 


6. Multiplicity of solutions of the boundary value problem. In this 
section we are interested in the number of possible distinct solutions q of the 
boundary value problem .B, in their dependence upon the parameter A. We 
proceed first to give a general description of what can be expected to occur 
with increasing A. For sufficiently small A there will be only one solution, i. e., 
qg=0. This holds up to AA, where A» is the lowest eigenvalue of the 
linearized buckling problem. Beyond A, two new solutions will appear which 
differ only in sign. We shall prove in this section that these solutions together 
with g=0 are the only solutions as long as A < Ax, where A; is:the second 
eigenvalue of the linear buckling problem. It could be proved, for example 
by the perturbation method, that:two additional solutions will appear when A 
passes the next critical value A,. It seems likely that there would be exactly 
2n +1 solutions when An. <A< An, one the solution g=0, and n other 
pairs differing only in sign. In this section we prove less, namely: 


THEOREM 6.1. For0 SAA, the sole solution of By ts qg =0. 


THEOREM 6. 2.°° For Ay <A <A, there are at most three solutions of 


By; one of these is g==0 and the other two differ only in sign. 


We shall make use of the following properties?" of the eigenvalues A,” 
and A,”._ The inequality 


(6. 1) qD\q = qDq —A*qHq > 0 


holds for admissible functions g #0 (cf. sec. 5) if A < Ao, equality holding 
for the eigenfunction ¢=4qo; the inequality (6.1) holds also for A»<A< Ax 


if the additional restriction 

26'This theorem could be easily generalized. The essential properties used are 
readily seen to be: D+ UH is a positive definite quadratic form, H is a completely 
continuous quadratic form with respect to D+ H as unit form, and K is a positive 
definite, centro-symmetric, and convex functional of degree greater than one. The reason- 
ing given here could, with slight adaptations, be taken over for such a generalization. 


*7 Cf. Courant-Hilbert [3] Volume I, Chapter VI. 
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(6. 2) 


is imposed, where qo is the eigenfunction belonging to Ap. 

In sec. 5.it was stated that solutions g of the boundary value problem B) 
also solve S), i.e., they are completely characterized by the statement that 
8V,[q] = 0 for all admissible 

If g renders V,[q] stationary, the identity 
(6. 3) qgDyq + 2prK pr = 0 
holds. It results when 8q is replaced by qg in (5.08) and (5.09). 

For AA, we may apply (6.1), and (6.3) leads to p-Kp,—=0. In view 
of (5.04) and (5.06) this is possible only for g==0. Thus Theorem 6. 1 
is proved. 

In the proof of Theorem 6.2 we make.use of the fact that the positive 
definite form p,Kp, is a convex functional (of fourth degree) in g. By con- 
vexity we mean that the second variation 8p,K8ép, + 8°p,Kp, of p,rKpr is 
positive. This is the case since the second term is positive because p, > 0 and 


Sp, = r® 4; (8q¢)?r,°dr, > 0, while the first term is obviously positive. 
0 


The proof of Theorem 6.2 will be given indirectly. Assume, then, that 
there are two linearly independent solutions g‘ 0, gq) 40. We introduce 
the linear combinations g = aq‘) + aq") and consider the homogeneous 


forms 
Q (4%, %) = qDyq, P (a, %) = prK py. 


Since and make V) stationary, the sum. V), = Q(a, + P(@, %) 
is stationary for (1,0), (—1,0) and for (0,1), (0,1). Our Theorem 6. 2 
is proved if we can show that all stationary points of V; = P + Q lie on the 
same straight-line through the origin in the (¢,, «)-plane, in contradiction 
with the assumption that (1,0) and (0,1) are stationary points. 

The quadratic form is indefinite since 1. 9(1,0)=—q™ Dyqg™ <0 
as we see from (6.3); 2. the linear set q contains a function ¢ = 4,q") 
+ Gq) 40 satisfying gHq.—0 and since A<A, we know that (6.1) 
holds for 9; thus Q(&, = qGD)g > 0. The form P(@, a2) is of the fourth 
degree, and is positive definite and convex since, as we have seen, this is the 
case for p-Kp,. Both forms are clearly centro-symmetric and stationary points 
occur in symmetrically located pairs. 

Let %,,% be any values of a, and @ (a,?-++ %*s40) for which 
Vy. = Q( a1, %) +P (a, %2) is stationary, i. e., for which 8V, 8Q + 6P = 0. 
As a consequence we have 


(6. 4) — Qa,da, — Qa,d%, = Pada, + Pada. 


j 
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for a, =, %—G. Let P and @ be the values of P and Q for a, = %, 
== In view of (6.4) the curves =P and Q(1, %) = @ have 
a common tangent at (%, 

Consider a second point where Vj =P + Q is stationary and where, 
consequently, two curves P = const., Q@ = const. have a common tangent. By 
a similarity transformation in the («,, #)-plane we can transform P = const. 
into the original curve P = P while the equation for the new curve obtained 
from Q = const. by the transformation may be written in the form Q = «Q 
with « > 0. In order to show that such a second stationary point is on the 
same ray as (&,,%) it is obviously sufficient to show that it concides with 
(%,, 4%.) after the similarity transformation. This we accomplish by proving 
that the set of curves Q = «Q contains only one curve, namely Q = Q, tangent 
to P=P and that Q=@ has only two symmetrically located points of 
contact with P = P. 

The set of points P S P is convex since its boundary is a level line of a 
convex surface. The curves Q —«Q constitute a set of hyperbolas since 
Q (a, %) is an indefinite quadratic form ; we need evidently consider only one 
branch of these hyperbolas. Each such branch is the boundary of an unbounded 
convex set S,. In addition it is to be noted that the origin is not contained 
in S,, but is an inner point of the set P= P. 

When the curves P= P and Q ~«Q have a common tangent, the two 
convex point sets P< P and §, lie entirely on the same side of the tangent 
line, or they lie entirely on opposite sides. The first case is excluded since 
the tangent to the hyperbola Q = «Q separates the set S, from the origin and 
this point lies in the interior of the set P< P. In the second case the two 
convex sets have only one point in common since S, is bounded by a hyperbola 
and P = F is convex. 

We have assumed that a common tangent line exists for «1; such a 
tangent line separates the sets P< P and S,. For every « >1 it is obvious 
that the sets PP and S, have no common points. For every « < 1 it is 
obvious that the two sets have inner points in common; hence a possible new 
tangency would fall under the first case which has already been excluded. With 
this we have established the contradiction which proves our theorem. 

It appears that the type of reasoning used above would not be sufficient 


to give a corresponding theorem for A > dj. 


7. Simultaneous formulation of finite and asymptotic problems. For 
the investigations of sections 8 and 9 it is convenient and useful to formulate 
our problems in terms of certain new variables. At the same time the new 
formulations make possible simultaneous treatment of the finite and the 
asymptotic problems. 
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The new variables are (cf. sec. 5) 
(7.01) P= p/p and (7. 02) 
the new independent variable is 
(7. 03) 8B =A(r?—1)/2 with the domain 0=B < o. 
With the notation 
(7. 04) x == we have (7. 05) r= (1 + 2B)’. 
The parameter «x occurs essentially only in the combination 
(7. 06) pX(B) = (1 + 


In the following formulations we admit x = 0; the cases x > 0 are equiva- 
lent to the cases for finite A (cf. sec. 5), while the case x = 0 corresponds to 
the transition A—> oo and will be seen to represent the asymptotic case. 


We introduce the forms 


(7. 07) H{Q]= 


Q2(B)p*(B) 4B, 


(7. 08) = Qs?(8)d8 +«(1 +») Q2(0), 
and 

(7.09) K*{Q] dB, 

where Pg is a functional in Q through 

(7. 10) Ps = =— (1/2) Q?(B:)p*(B:) dBi. 
The functional *® to be minimized is ; 

(7.11) =D* + K*[Q]. 


By admissible functions we mean functions Q(8) continuous in 
< with L?-integrable derivatives in every interval BS=b)< 
and for which the integrals in (7.07), (7.08), and (7.09) are finite, Pg 
being defined by (7.10). 

The minimum problem M* is that of minimizing W*[Q], the problem 
S* is that of making W*[Q] stationary, in each case with respect to admissible 


functions Q. 


28 The variable 8 here is not the same as that of sec. 4 but it yields the same 
transition to the asymptotic case. The transformation r= (1—«f) of sec. 4 could be 
obtained as the term of first order in the development of (7.05) with respect to kx. 


2° The relation between Vy (see (5.5)) and Wk is Wek = KV). 
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The boundary value problem B* requires the determination of an admissi- 
ble function Q(8) possessing a continuous second derivative and satisfying 


the differential equation 


(7. 12) Ose + = 0 
and the boundary condition 
(7.13) Qs(0) —«(1+v)Q(0) =0. 
The function P(f) in (7.12). is defined by 
oo 
(7.14) P(B)=1+ 
0 
where Ps is given by (7.10). The function P therefore satisfies the differential 
equation 
(7. 15) Pep p*Q?/2 
and the boundary condition 
(7. 16) P(0) = 1. 


The problems J/*, S*, and B* (x >0) are the equivalents, in the new 
variables, of the problems M), 8), and By of sec. 5. 

To obtain the formulation of the asymptotic problems M°, 8°, and B® we 
need only set = 0 in the preceding. Since x = 0 implies p? = 1 (cf. (7. 06)), 
this formulation yields the same differential equations and boundary condi- 
tions as in sec. 4 except that the conditions used in sec. 4 at 8B = o are here 
replaced by the admissibility conditions. 

The connection between the problems S* and B* (including x —0) is 
based on two “ Green’s” formulas. They refer to the first variation 


(7.17) BIVLQ] = — + dB 
+ «(1+ ¥)Q(0)3Q(0), 


where SP. is defined, in accordance with (7.10), by 
B 


It is convenient to state first the following lemma, the proof of which 


will be given later: 


LemMa 7.1. Jf f(8) and g(8) are continuous functions with L?- 


< < ow, then there is 


0 


integrable derivatives for which f 


0 
a special sequence B—> «& on which 


(7.19) f(B)ga(B) > 9. 
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CorotLary. If 1,9, and fo, g2 are pairs of functions with the properties 
of f and g in Lemma 7.1 there exists a special sequence B—> «© on which 


(7.19) holds for both pairs. 
We consider (for finite b > 0) the relation 


b 
—— { (P—1)8P aad + [(P—1)8P 9]? 


obtained by product integration. Lemma 7.1 with f—(P—1) and 
9g = shows that if on a special sequence. 


Hence we have 
co co 
f, =— (P—1)8Pppdp + [(P—1)8P plo. 


If this and the relations P(0) —1, Pag = Q8Q (cf. 7.18) are used one 
obtains from (7.17) the first Green’s formula: 


(7.20) 8W*[Q] = — + «(1 + v)Q(0)8Q(0). 


co b 
It holds for all admissible Q and 8Q and f means lim f when b)— © on 
0 


0 


a special sequence. 
If Q possesses a second derivative we may write 


b 


The corollary to Lemma 7.1 applied to [(P—1)8Pg]’ and [Qs5Q]? yields 
the existence of a common special sequence b — o on which both expressions 


ie, ©) 
tend to zero. If this sequence is used and if f QgsQedB8 is replaced by 
0 


— + [Qs5Q]. in (7. 20) we obtain the second Green’s formula 
(7.21) f [Qe + 
— + «(1 + v)Q(0) ]8Q(0), 


which holds for admissible functions Q possessing second derivatives, the 


integral again being “conditional.” Formula (7.21) yields immediately 


xt bP ,*dB < © follows from the admissibility conditions on Q and 5Q and the 


0 
Schwarz inequality applied to (7.18). 
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THEOREM 7.1.51 ‘A solution of B* solves S*. 

The converse also holds: 

THEOREM 7. 2.°1 A solution of S* (hence also a solution of M*) possesses 
a continuous second derivative and solves B*. 

To prove Theorem 7.2 we make use of the following well known 

Lemma 7. 2.°? Let R(B) be an L?-integrable function and be a 
continuous function such that f, = ff, holds for all continuous 


functions T(8) with L*-integrable derivatives Tg which vanish identically in 
the neighborhood of B=0 and of B= «. Then R(B) coincides (almost 
everywhere) with a function R which possesses the continuous derivative — 8S. 
We note in addition that R=R in case R is the derivative of a continuous 
function. 

We apply this lemma to R = Qg, S = p*PQ, where Q is a solution of S*. 
Since 8W*|Q] —0 for the admissible variations 8? = T it follows from (7. 20) 
that Q possesses the continuous second derivative —p*PQ; thus Q satisfies 
the differential equation (7.12). We may now apply (7.21); it yields 
0 = [Qe(0) + «(1 + v)Q(0) ]8Q(0) and since 8Q(0) is arbitrary the boun- 
dary condition (7.13) is satisfied. Hence Theorem 7.2 is proved. 

We have to supply the proof of Lemma 7.1 and its corollary. This will 
be done with the aid of two additional lemmas of which we shall make 


frequent use in later sections. 


LemMMA 7.3. “Jacobi’s identity.” If f(B) is a function with L?- 
integrable derivative and the positive function w(B) possesses continuous 


second derivatives, then 


b 
(7. 22) + otopef? = f w*[ (of) + wo 


The identity (7. 22) follows from 
f f (fp? — + dB 
= (fp + *wppf?) dB — 


81 These theorems are equivalent (for x > 0) to the statements made at the end of 
5. 
52 The lemma can be proved in much the same way as the lemma of du Bois-Reymond 


to which it reduces if R(8) is assumed continuous. 


sec, 
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Lemma 7.4. If f(B) is a continuous function with L*-integrable derwa- 


tive for which f. f?(B)dB < «, then 
0 
(7. 23) B*f(B) as Bo 


To prove the lemma we start with Jacobi’s identity for o = ~’/?: 


or 


b 


For }— o the right member is bounded; hence the terms of the left member 
are bounded since all are positive. From 


we see that B*f?(8) — 0 at least on a special sequence B—> 0. If b is taken 
on that special sequence we obtain 


which, for a— o, yields a“*f?(a) + 0 and the Lemma 7. 4 is proved. 
We turn now to the proof of Lemma 7.1. From Bgp°B dB < « 
0 
we infer that a special sequence exists on which B'/*gg—>0; in view of 
Lemma 7.4 we have 8’/*f(8) +0 and (7.19) follows immediately. 


For the proof of the corollary it is sufficient to observe that from 


(B[9:8]* + Bl < a special sequence B— o exists on 
0 

which B[ gig]? + Bl and that therefore also B'/*g,g—0 and 

> 0. 


8. Uniqueness theorems. In sec. 6 we proved that the boundary value 
problem B, has, apart from sign, only one solution g #0 for a certain range 
of values of A (A < A). It was also indicated there that the problem B) will 
possess more and more distinct solutions as A increases beyond A,. The situa- 
tion is quite different as regards the minimum problem M). We shall prove 


in this section (with the notation of sec. 7): 


THEOREM 8.1. There is at most one solution Q of the minimum problem 
M*, apart from the sign of Q. 


| 
3 
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We first dispose of the case in which Q =0 is a solution of M*. In this 
case the functional W* is non-negative; otherwise there would be a function 
Q with W*[Q] < 0 and W*[Q] =0 would not be the minimum. If Q is any 
solution of M* with 


= D*[Q] — + K*[Q] =0, 
then, for constant « 
W*[eQ] = &{D[Q] — H[Q]} + &K[Q] = K[Q] 


would be negative for «<1 unless K[Q] —0, which implies Pg = 0, and 
Q? = 2Psg = 0. Hence Q=0 is the only solution of M* if W* is non-negative. 
From now on we may thus leave aside the case in which Q =0 solves M*. 
We have noted in the preceding section that a solution of the minimum 
problem M* also solves S* and the boundary value problem B* (cf. Theorem 
7.2). Hence Theorem 8.1 (for Q#0) results from the following two 


theorems: 


THEOREM 8.2. A solution Q(B) #0 of M* is nowhere zero in the 
interval O= B< 


THEOREM 8.3. A solution Q(B) of B* which is nowhere zero in the 
interval 0 = B < oo is, apart from the sign of Q, the sole solution of M*. 


An immediate consequence of Theorem 8.3 is the following 


Corotiary.*® The problem B* has only one solution, apart from the 
sign of Q, which is nowhere zero in the interval 0 S oO. 
g ‘ 


We prove Theorem 8.2 indirectly. Assuming then that the solution 
Q(B) of M* vanishes for some ‘value B=. We distinguish two cases: 


That Case 1 or Case 2 occurs follows from the fact that P(8) decreases 
monotonically from the value P(0) = 1. 

Case 1: We make use of the fact that Q(8) solves B* and hence satisfies 
the differential equation Qgg + p*PQ =0. If a solution Q(B) of such a 
differential equation vanishes at a point B =o, then, as is well known, 

83 This corollary shows that the solutiens for the finite and the asymptotic cases, 
as treated in sec. 3 and sec. 4, are the sole solutions of B« provided that the power 
series used there converge and the transcendental equations are solved. The admissi- 
bility conditions are satisfied if the series converge and the non-vanishing of the solu- 
tions is guaranteed by taking the lowest root of the transcendental equations, as one 


easily shows. 
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Qe(Bo) #9 unless Q(8) =0. Hence, in our case, Yg(Bo) #0. We may 
assume Qs(Bo) > 0. Let b be the greatest value such that Q(8) =0 for 
Bo =B< bd. In this range we then have p*PQ=0, Qgg =O and, conse- 


quently, Qs(8) = Qa(Bo) and Q(B) = (8—Bo)Qa(Bo). If 6 were finite 
the latter inequality would yield Q(b) >0, in contradiction with the 


definition of b. Hence b=. The inequality = Qg(Bo.) > 9 for 

Bo = B < @ is, however, in contradiction with < This rules 
0 


out Case 1. 


Case 2: We shall dispose of this case by constructing an admissible 
function Q¢ for which W[Q*] < W[Q] in contradiction with the minimum 
property of Q. We first replace Q by | Q| and observe that W[| Q |] = W[Q]. 
We choose 8, = Bo and B2 > Bo such that 


P>0 for BS Be 


We then introduce a non-negative function »(8) 50 with continuous deriva- 
tive which has the following properties: 


(8) =0, =0 for and 
for BSP, SO for B=> Po. 


Upon introduction of the function Q* =| Q | + ey, we notice that 


f, “[Q*(B:) — dB: = 
B B 


where 


Hence 


K[Q]—K(Q] —— + (e/4) 


— (0) Q| Pag 


For the latter equation the definition of 3«(8) was used. Since »=0 for 
ee) Be 
Brand P > 0 for Bz, we have n|Q|Pap— 1|Q| PdB > 0. 
0 


q 
f 
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It is clear, therefore, that « can be made so small that the quantity in the 
brackets is negative. Hence 


(8. 02) K[Q‘] —K[Q] < (0). 
Further we have, from the definition of (8), 


f 
Bo Bo 
Bs Bo 
=<[—2 J, ns | | Qe | ap+e np ap}. 


Evidently we can choose e so small that also here the quantity in the brackets 


is negative ; i. e., 


co co 
(8. 03) J, <0. 
0 0 
By combining (8.01), (8.02), and (8.03) we obtain 
WLO*] — W[Q] < 0 


in contradiction with the minimum property of Q. This contradiction estab- 
lishes Theorem 8. 2. 

Theorem 8.3 is equivalent to the statement: if Q° is any solution of B* 
which does not vanish for 0 = B < o, then 


W*[Q] = W*[Q*] 


for every admissible function Q and the equality holds only for QQ = + Q°. 
Let P and P® be the functions corresponding to Q and @Q°. We derive 
the identity 


oo co 
(8. 04) f (1— = 2 f 
0 
co 
The integral on the right-hand side exists in view of f. [P,°]*dB < o and 
0 


foe) 
f Ps’°dB < «. To prove (8.04) we introduce the identity 


From Lemma 7.1 applied to f = 1— P°, g = P it follows that the last term 
tends to zero when a—> © on a special sequence and (8.04) is established. 


co 
Formula (8.04) implies that f (1— P°) Q*p*dB exists for every admissible 


lay 
for 
Se- 
he 
‘or 
les 
le 
m 
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Q; in view of (1—P*) =0, | P?|<1+(1—P*) and < 
7/70 


we can infer 
co 
(8. 05) *4 | P?| < o. 
We now introduce the quadratic functional 
oo oo 
which, in view of (8.05), is defined for admissible @. From (8.04) we have 
—T(Q]+ Perap—z 


a 


dB, 
and for 9 = Q° 
Subtraction yields the identity 
(8.08) W[Q] = — 710°) + 
Theorem 8. 3 is a consequence of (8.08) and 
LemMA 8.1. For admissible Q 
(8. 09) T[Q] =0, 
where the equality holds only for Q = cQ°, c = const. 


Lemma 8. 1 implies 
(8. 09) T[Q°] =0. 


If Lemma 8.1 holds, (8.08) yields W[Q] = W[Q°], the equality holding 


oa 
only for 9 = cQ°, Pg = P°s or f [Q? — (Q°)?]p*dB = 0. Since p* > 0, the 


if 
last equality holds only when Q = + Q°. Hence Theorem 8. 3 is proved once 
the inequality (8.09) is established. This inequality states that Q° minimizes 
the quadratic functional T[Q]. 
We proceed to establish Lemma 8.1. Our proof follows the line of 


** From (8.05) and the admissibility conditions for x = 0, p° = 1, it follows that 
oo 
[Q,° — P,° + PQ?]dBf is finite. For the solution Q, P the integrand is constant 


according to (4.13). This constant is therefore zero, confirming the assumption made 
in (4.14). 


| 872 

| 
| 

| 
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ve 
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Jacobi’s treatment of the second variation ; however, the infinite range of the 
independent variable and the singularity occurring in our problem require 
considerable modifications of the standard reasoning.*® In view of Q° >0 
it is possible to introduce the function 


(8. 10) 6=Q/Q°. 
With this function 6 the identity 
(8.11) 


holds, as we shall prove. The identity (8.11) implies that T[Q] = 0 and that 
the equality holds only if 06g (since Q°s40), that is, if const., 
or 9 =cQ°. Thus Lemma 8.1 follows from (8.11). 

To prove (8.11) we apply Jacobi’s identity (7. 22) to o—@° and obtain 


a 
0 0 


or, in view of Q°sg = — p*P°Q® and Q°s — «(1 + v)Q°|° =0, 


— P°Q*p* + «(1 + v)Q?(0) = + 


If it could be shown that the last term tends to zero as a—> © on at least a 
special sequence, (8.11) would result in view of (8.06). It is doubtful 
whether such a direct attack is possible. 

Instead, we proceed as follows: Since B*[Q°(B)]*?—>0, according to 


Lemma 7.4, there is a special sequence B— « on which (8-*Q°)g = 0, or 
= $8°Q, or = But 0. If a is taken 


on this sequence, the inequality 
oO 
(8. 11)- TIQ)< (Q°)2652d8 results. 
7/70 
Identity (8.11) will be established once the reverse inequality 
= 
(8.11) T[Q|= (Q°)26g2dB is proved. 
70 


To this end we introduce the function Qf = Q° + «8 and set 
(8. 10) for «> 0. 


We then apply once more Jacobi’s identity (7.22) with o = Q€ and obtain 


8>OQur reasoning could be made the basis of a general theory extending Jacobi’s 
treatment of the second variation to the case of singular or infinite end points. 


13 


| = 
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Jf, (Qe? + — (Q9? + 
using the relation 


— = p*P°Q°, = 1— and Q%s— (1+ 


we have 
+ (1 + 
Since there is a special sequence B—> on which 
(B) — 0. On this sequence we have 


in view of Lemma 7.4; hence (Q*)-'Q&sQ?—> 0. If we let a—o on this 
special sequence we obtain the identity 


Allowing « to tend to zero in (8.11), we have 


and 


The last quantity can be made arbitrarily small by choice of a. Further we have 


b oo 


Hence (8.11), yields in the limit 


and (8.11)* is established since b is arbitrary. Hence (8.11) holds, Lemma 
8. 1 is proved and, with it, Theorem 8. 3. 


9. Existence and asymptotic convergence. In this section we prove 
the existence of the solutions of the minimum problem M*, including the 
asymptotic case M°, and establish the convergence of the solutions for x > 0 
to the asymptotic solution (x0) as x-—>0. We apply direct methods similar 


| 

| 

\ 

| 
| 
| | 
i 
{ 


ve 
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to those used for linear boundary value problems (cf. Courant-Hilbert [3], 
Volume II, Chapter VII). 

We use the same formulation of the minimum problem M* as in sec. 7. 
Functions Q admissible with respect to the problem M* in the sense of sec. 7 
are here referred to as x-admissible functions. 

Our theorems are 

THEOREM 9.1. To every x=0 there exists at least one x-admissible 
function Q(8) for which W*[Q] attains tts minimum. 

Such a minimizing function will be denoted henceforth by Q*(B) ; it is 
uniquely determined (Theorem 8.1) once the condition Q*(0) = 0 has been 
imposed. 

THEOREM 9.2. The minimizing functions Q*(B) with Q*(0) = 0 tend 
as x—>0 to the minimizing function Q°(B) with Q°(0) =0 uniformly in 
each finite interval OS BS) < ~, and W*[Q] tends to W°[Q°]. 

The proofs of the theorems are based upon a number of preliminary 


lemmas and inequalities. 
We prove first 


2 
(9. 01) { < 
Jd 
for any x-admissible Q and b >0. Due to Pgg=0 (cf. (%.10)) the non- 
negative quantity — Pg does not increase with increasing 8. Hence 


establishing (9.01). 
Next we show that a constant c > 0 exists such that 
(9. 02) S + 2c{K*[Q]}* 


for any x-admissible ** Q and all x= 0. 
To prove (9.02) we distinguish two cases: kx [1,«=—1. 


Case 1 (kx =1). We consider the successive inequalities 
2 co 
Q?(B:) S 2Q7(B: + 4) +2 | — Q(B: + 4)/? 
f 


°° The argument used to prove (9.02) could be so modified as to show that the 
admissibility condition on H« is a consequence of the others and could therefore be 


omitted. 


€, 
36)? 
ch 
Lis 

| 
a | 
e 
e = 
T 
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Integration with respect to f, yields 
% 1 00 
0 0 


Since p* <1 and p* = (1+ 2«8)* = for we have 
% 1 
f < 2- 3° +4 f 
0 0 
By adding ff. p*Q°dB to both sides we obtain 


72 


We now use (9.01) with b = 3, that is 


and thus obtain 


where c= V2(1+2-3°). This establishes (9.02) for «<1. 
Case 2 (x=1). We set 
o = (1+ 8)*(1 + 28) 
in Jacobi’s identity (7.22) noting that 
op = (1+ (1+ 8)*(1+ 28)", opp =—2(1+ 8)”, 
and 
w ugg = 2(1 B)-2(1 + = 2(1 + = 2*(B). 

The result is 

co oO 

— dB = J + 

Since 

wo twp|° ], Bw-twg| © =(, and (), 


according to Lemma 7.4, the right member of the inequality is non- 


co 
f <4 f 
70 


This establishes (9.02) for x = 1 and hence generally. 
From (9.02) we deduce the inequality 


(9. 03) W*[Q] = D*[Q] + 4 —e’, 
which implies 


negative, i. e., 


THE NON-LINEAR BOUNDARY VALUE PROBLEM OF THE BUCKLED PLATE. 877 


LemMaA 9.1. For x-admissible functions Q, W*[Q] has a lower bound 
(—c*) independent of x. 
From (9.03) we obtain 
= +e, 
(9. 04) K*(Q] S {e+ (W*[Q] + c?)*}*, and from (9. 02) 
H*[Q] S 3c? + W*[Q] + 2c(W*[Q] + c?)%. 
Consider a set of x-admissible functions Q (with « not necessarily fixed) for 
which W* has an upper bound M. We conclude from (9. 04) 
LemMaA 9.2. An upper bound M for W* implies upper bounds for D*, 
H*, and K* which depend upon M but not on x. 
In what follows we shall make frequent use of the following well-known 
lemmas: 
Lemma A. If fs(B) is a sequence of non-negative continuous functions, 
defined for 0 = B < which converge to a limit function fo(B) uniformly 
in every finite interval, then fo(B) is continuous, non-negative, and 


ff, fo(8)dB tim 48. 


Lemma B. If, in addition, to every «>0 a quantity b =b(e) exists 
such that 


f, (B)dB—> fo(8) a8. 


We proceed to state the following basic 


LeMMA 9.3. Let xm be a sequence of values of x tending to a finite 
value xs, and let Um be a sequence of km-admissible functions for which W*"[ Qm] 
is bounded. Then there exists a subsequence Q, converging uniformly in every 


finite interval to a x+-admissible function Q such that 


(9. 05) lim = 
(9. 06) lim H"[Q.] =H™[Q-], 
(9. 07) lim K"[Q,] = K™[Qe], 
(9. 08) lim W" [Qs] = W[Q.]. 


Proof. Lemma 9.2 shows that D*=[Qm], H*™[Qm], and K*"[Qm] are 
bounded. A bound for D*=[Qm] (cf. (7%.08)) implies a bound for the 


sequence J Q?mp(8)dB. Hence, as is well known, there exists a subsequence 
0 


00 
f fs(B)dB Se for all s, then 
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converging uniformly in every finite interval 0 = 8 = to a continuous limit 
function Q+ with L*-integrable derivative and such that 


This inequality yields (9.05), in view of (7.08) and Q.(0) >@Q-(0). In 
order to establish (9.06) we use (9.01) in the form 


Since K** is bounded, the right member can be made arbitrarily small by 
proper choice of b. Hence we may apply Lemma B with fs = p**Q,? and thus 
obtain (9.06). By virtue of the definition (7.10) of Pg we have 

| Pe [Qs] — Ps*[Q-] | = 2 | H"[Q.] —H*[Q-] | 
and (9.06) yields the uniform convergence of Pg*[Qs| to Ps"*[Q+]. Hence 
we may apply Lemma A to K**; the result is (9.07). 

As a consequence of (9.05), (9.06), and (9.07) Qs is x+-admissible 
and, in addition, (9.08) holds. 

We are now in a position to prove Theorem 9.1. We turn, therefore, to 
the problem of minimizing W*[Q] by a«-admissible function Q. From Lemma 
9.1 we know that the g.1. b. w* of W*[Q] is finite; hence there exists a mini- 
mizing sequence, i.e., a sequence of x-admissible functions Qm for which 
W*[Qm] has as limit wo. We now apply Lemma 9. 3 with xm = «x+—x«; it yields 
the existence of a subsequence Q, and a «-admissible function *? Q = Q. for 
which (cf. (9. 08) ) 

W*(Q] Slim W*[Q.]. 


Since the right member here is the g.1. b. w* of W*, the equality must hold. 
Hence Q solves the minimum problem M*. This proves Theorem 9. 1. 

We add the remark that the minima w* of W* have the common upper 
bound zero, i. e., 
(9. 09) ow = 0, 


which follows immediately from W*[0] 0 since Q=0 is an admissible 
function. 
Before proving Theorem 9.2 we establish two lemmas: 


87Tf we had required the existence of continuous second derivatives for admissi- 
bility it would have been necessary at this point to prove that the limit function Q 
has this property—otherwise Q could not be identified as the solution of Mx. Our pro- 
cedure, which requires only L?-integrable derivatives for admissibility, makes it possible 
to separate the problems of existence and continuous differentiability of the solution 
of Mx, 


= 
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Lemma 9.4. A 0-admissible function Q(B) is also x-admissible for 
every x > 0. 

This is an immediate consequence of p* = p° = 1 and 
(9. 10) S 
in view of definitions (7.07) to (7.10). 


LemMaA 9.5. For any 0-admissible function Q(B): 


(9.11) > W°[Q] as «> 0. 
Proof. From (7.08) it is obvious that 
(9. 12) D°[Q] as 0. 


We apply Lemma B to 
(1—e)ap, 


observing that 


can be made arbitrarily small by proper choice of b. It follows that 


(9. 13) H*(Q] — H°[Q] as 0. 
The latter implies, in view of (7.10), that 
(9. 14) Ps*[Q] > as 0. 
We now consider 


The right member, in view of (9.10), has a positive integrand and 


can be made small. This fact and (9.14) permit us to apply Lemma B to (9.15) 
with the result 
(9. 16) K*[Q] > K°[Q] as «> 0. 


Relations (9.12), (9.13), and (9.16) lead immediately to (9.11). 

We now take any sequence of positive values x tending to zero and solu- 
tions Q* of the corresponding minimum problems M* (which exist according 
to Theorem 9.1). We may assume Q*(0) =0. As remarked above (cf. 
(9.09) ), the values W*[Q*] = * have an upper bound. We can therefore apply 
Lemma 9. 3 to the sequence Q* with x«—0. The lemma insures the existence 
of a subsequence converging in the sense of the lemma to a 0-admissible limit 


— 
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function Q . From now on Q* refers to such a subsequence. From (9.08) 
we have, in particular, 
(9. 17) lim W*[Q*] = W°[Qo]. 

We proceed to show that Q, solves the minimum problem M°. This 
minimum problem, according to Theorem 9.1, has a solution Q° for which 


(Lemma 9. 5) 


(9. 18) lim W*[Q°] = W°[Q°]. 

As a consequence of the minimum properties of Q@° and Q* we have 
(9.19) = = WwW [Q] and 

(9. 20) of == W*(Q*] = W*(Q°]. 


Here we have made use of the fact that Q° is x-admissible (Lemma 9. 4). 
From (9.20) we find 


(9. 21) lim W*[Q*] Slim W*[Q°]. 

Successive consideration of (9.21), (9.18), (9.19), and (9.17) yields 

(9. 22 Tim W*[Q*] W°[Q°] W°[Q.] Slim 

and this obviously implies the equality 

(9. 23) W°[Q°] = W°[ Qo]. 

Since W°[Q°] is the g.1.b. of W° it follows that the function Q = Q,p is a 


solution of the minimum problem M* for x =—0 with Q(0) = 0. 


From Theorem 8.1 we know that the minimum problem M° has at most 
one solution Q with Q(0) =0. Therefore Q°= Qo, i.e., all convergent 
sequences (* converge to the same limit function. If a sequence has the 
property that every subsequence contains a convergent, subsequence with limit 
LI and if Z is the same for all such convergent subsequences, then the original 
sequence itself converges to L. Therefore we conclude in our case that the 
solutions Q*(8) with Q*(0) = 0 of the minimum problems M* converge, as 
x— > 0, to the unique solution Q°(8) with Q°(0) = 0 of the minimum problem 
M°. This completes the proof of Theorem 9. 2. 


10. Limit state in the interior of plate. While the limit procedure of 
sec. 9 concerns the boundary layer, we deal in this section with the limit pro- 
cedure in the intertor of the plate as A—> 0. These two limit procedures have 
already been discussed and contrasted with each other in sec. 4. 

We summarize the discussion of sec. 4 concerning the interior limit 
process, but we use the notation of sec. 5. By multiplying (5.10) by «* and 
(5.12), (5.13) by «? («x =A") we obtain 


(10.01) + (xp) (xq) =0, 


i 
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(10. 02) G(x?p) = (xq)?/2, and 
(10. 03) for r—1. 


When «— 0, the following equations (cf. 4.01) result 


(10. 04) ig = 0, 

(10. 05) G(p) = 9/2, and 

(10. 06) p=1 for r=—1, where 
(10. 07) p=lim (x*p), g=lim (xq) for 


One expects that the limit equations (10.04) and (10.05) will be satisfied. 
The only admissible solution of them is p= const., Y=0. It was pointed 
out in sec. 4 that the proper constant value for ~ is not to be taken from 
(10.06) but rather from the asymptotic solution of the boundary layer 
problem, i.e., Pp—=P(«). Since P(co) is negative, this means that the 
radial membrane stress tends to a negative constant in every interior region. 


Precisely, we prove the following 
THEOREM 10.1. As «—>0 the limit relations 
(10.08) «?p,(r) > P°(), (10. 09) > 0 


hold uniformly in every interval OS rSro <1. Here qy(r) is the solution of 
the minimum problem M) (cf. sec. 5) and py is given by (5.07) and (5. 06). 


Once Theorem 10.1 has been established, it is clear that the limit pro- 
cedure discussed above leads to correct results for those solutions of B, which 
do not vanish (cf. Theorem 8.3). 

With the notation of sec. 7 the convergence relations (10. 08) and (10. 09) 


become 


(10.10) P*(«£) P®(), (10.11) «*Q*(«*f) 0; 
where 
(10. 12) 


Theorem 10.1 is evidently proved once these relations are shown to hold 
uniformly in every interval 0 << =. @Q*() is the solution of the minimum 
problem M*. In what follows the superscript « on the quantities P and Q is 
omitted where no confusion could result. 

We make use of various properties of the functions Q and P. From 
Theorem 8. 2 we have 
(10. 13) = 0. 


We show next that there exists a value By of B such that 
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(10. 14) P°(Bo) < 0. 


This can be seen indirectly as follows. If P°(8) = 0 for all B, (10.13) and 
(7.12) for x =0 show that Q°sg would be = 0. Consequently the function 
Q°(8) s£0 would be non-decreasing or would become negative infinite. But 
both cases are impossible in view of the admissibility condition 


f, (@)*dB < @. 

Since P*(B)) and converge to P°(B.) and Q°(Bo) as 
(Theorem 9.2) there exist positive values xo, c, c,, and d such that for x S ko 
(10. 15) and 
(10. 16) Q*(Bo) 


hold. From now on we assume always x = x, and B= Bp. 
Our subsequent discussion is based on the following formulas, which hold 
for all xk =0: 


(10.18) P(B,) —P(B.) = 4 (8. Pal Bs), 
(10.19) —Q9(8) (—P)dprdf, and 
(10.20) Q(B) — 


where the bar indicates that 8 has been replaced by B. 

Formula (10.17) is the definition of Pg (sec. (7.10)). Formula (10.18) 
is obtained by integrating (10.17) from B, to B. and applying integration 
by parts to the right-hand side. Formula (10.19) results from the differen- 
tial equation (7.12) by integration from B to o and use of the fact that 
Qs(B) > 0 if B—> o at least on a special sequence, the latter following from 
the admissibility condition D*[Q] < © (cf. (7.08)). Formula (10. 20) 
follows from (10.19) in the same way as (10.18) was obtained from (10.17). 

From (10.17) we have 


(10. 21) — Pg=0; from (10.18) and (10. 21) 
(10. 22) for 
and, in addition, in view of (10.15), 


(10. 23) —P(B) =e> 60, B = Bo. 
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Correspondingly we obtain from (10.19), (10.20), and (10.16), in view 
of (10. 23) and (10.13): 


(10. 24) Qs = 0, 
(10. 25) Q(B:1) = Q(B.) for and 
(10. 26) Q(8)=d, B=Bo. 


In view of (10.23), (10.25), and (10.24) we decrease the right member 
of (10.20) when we replace — P by c, @ by Q(z) and omit the second term. 
We have thus 


— QBs) = 00 (Be) 


The left member of this inequality, in view of (10. 26) and (10.13), is Sd. 
Hence we obtain 


Be. 
(10. 27) Q (Bz) (8 — B:)p*dp]*. 


For x = 0, (p° =1), (10.27) yields, with B: = Bo, Bo = B, 
(10. 28) 2de-*(B — 


For x > 0 we estimate the.bracket on the right side of (10. 27). We introduce 
the function o(£:, 82) by the formula 


. - 
(10. 29) f (8 


from which, in view of p*(8) = p'(xB) (cf. (7.06)), we have 


(10. 30) f, (2 — Bi) "dB (xB1, xB2). 


We insert (10. 30) in (10. 27) with 8, = Bo, replace Bz by x (cf. (10.12)), 
and obtain for x = £/B 
(10. 31) < de-*/o(xBo, 
We have already restricted { to the interval 0 < £ < @, and, if we restrict x 
by S/28 (as we may do), we have o(xBo, £) = o(Lo/2, Hence we 
derive from (10.31) the inequality 
(10. 32) S fo). 
The left member is bounded as « tends to zero for every £) > 0. This obviously 
proves the convergence relation (10.11). 

We turn now to the proof of the relation (10.10). An immediate 
consequence of (10.26) and (10. 23) is 


Q? < de(— PQ): 


aT 

t 

| 
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insertion of this in (10.18) after replacing Pg in accordance with (10.17), 
and comparison with (10. 20) and (10.19) yields for B. > B; 
(10. 33) P(B:) — P(B2) < $de*[Q(B:) — Q(B2)]. 
The right member is = $dc*@Q(f,) in view of (10.25). Hence, when B2.— ~, 
—P(.) tends to a finite limit —P(«) since it increases monotonically 
(cf. (10. 22)). 

We consider the inequality 


| — S| — P*(B:)| + | P*(B:) — P?(A:)| 
| P°(B:) — 


The first term on the right is to be estimated by (10. 33) and 
| — S| + | — + | 


The resulting inequality is 


(10.34) | P*(x*£) — P°(c)| < 4de | 
$dc™ | Q*(B1) — Q°(B1) | + $de | Q°(B1)| + | P*(B1) —P°(B:)| 
+ | P°(B:) —P°(o)|. 


Since > and 0 as Bi: (cf. (10. 28)) we can 
choose 8, so large that the third and fifth terms on the right side of (10. 34) 
are arbitrarily small. Since P“(8,) P°(B,) and Q*(B:) > @Q°(B:) (Theorem 
9.2), and since Q*(«x"{) > 0 (cf. (10.32)) as x0, we can choose x so 
small that the remaining terms on the right side of (10.34) become arbi- 
trarily small. This establishes the convergence relation (10.10) and the 
proof of Theorem 10.1 is completed. 


APPENDIX. 


Perturbation method and E. Schmidt’s bifurcation theory. The per- 
turbation method of sec. 2 finds its theoretical justification in its close relation 
to the bifurcation theory developed by E. Schmidt for a class of non-linear 
integral equations (cf. [14] and the literature given there). In this appendix 
we shall reduce our problem to an integral equation of a similar type. A certain 
singularity occurring in our problem does not interfere with the applicability 
of E. Schmidt’s procedure, of which we give a brief indication. 

We begin by considering the linear boundary value problem 


(B); gDq< 0, 


where the operator G and the forms D and H are defined in sec. 5 and 
f=f(r) is any given function. The corresponding homogeneous problem 
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(f= 0) is denoted by (B)o; we assume A,” to be the lowest eigenvalue of (B)o 


(cf. sec. 5). 
Our non-linear boundary value problem can also be written in the form 


(B)+ by taking for f the functional 


(I. 01) 

where the functional O is defined by 
1 

r 0 

while p» is given by 

(I. 03) p= = p—p° 


and represents the excess of the applied load p over the critical load p®. The 
problem thus defined will be referred to as (B) from now on. That (B) is 
our original problem can be seen as follows: instead of (5.07) we have 
p — p= 0(q"), hence (1. 01), in view of (I. 03), is the same as f = (p—2o") 
and the differential equation in (B) becomes Gq + pq = 0, i.e., equation 
(5.10). 

We proceed to construct the solutions of the linear problem (B);. If Ao? 
were not an eigenvalue of (B)o, the problem (B)+ could be solved by means 
of the Green’s function; in our case, however, Ao” being an eigenvalue, an 
improper Green’s function must be used to represent the solutions. Let 
j(r) =ar J, (Aor) (cf. (2.3)) be the first eigenfunction of (B)>o where the 
constant a is determined by 


(I. 04) jHj =1. 


The improper Green’s function s(r, p) (cf. Courant-Hilbert [3] Bd. I, Kap. V, 
§ 14,2) is then characterized as the solution of the boundary value problem 
with f =—j(r)j(p), which is continuous but has for r= p the jump 


d 


We note that s(7,p) is symmetric in r and p: 


singularity 


S(r, p) =8(p,7). 
Such a function can be determined explicitly: 
s(r,p) —— (x/2a*) n(r)j(p) + 4Lm(r)j(p) + m(p)] 
+ 
for r= p, and, by symmetry, for 7p. Here j(r) is the function defined 
above, and = (Aor), = 2 (Aor), Where Y, is the Bessel 


= 
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function of first order and second kind and J. that of second order and first 
kind. The constant 6 is arbitrary and indicates that s(r, p) is determined only 
within a multiple of 7(r)j(p). 

With s(r, p) we construct the operation 


1 
Sh— f 
0 


for any function h(r) with hHh < «. The function Sh satisfies the dif- 


ferential equation 
(I. 05) (G+ =—h+j(jHh), 


as can be inferred from the above properties of the Green’s function s(r, p) ; 
in addition, g = Sh satisfies the other conditions of problem (B);. For a 
function f orthogonal to j, i.e., satisfying 


(I. 06) jHf =0, 


the function g = Sf is then a solution of (B);. Since such a solution of (B); 
is unique within a multiple of j, we obtain all solutions of (B); in the form 
(I. 07) q=«j)-+ Sf for constant «. 

Upon applying the operation S to h =j we obtain (G+ Ao”)Sj = 0, 
indicating that Sj is an eigenfunction of (B)» and hence that §7 is a multiple 
of j, say Sj—cj. After replacing s(r,p) by s(r,p) —cj(r)j(p) (which 
amounts to fixing the constant b in the explicit expression for s(r,p)), the 


relation 
(1. 08) = 0, 


follows from the definition of the operation S and (I. 04). We assume (I. 08) 
to hold from now on. An immediate consequence of (I. 08) and the symmetry 
of s(r, p) is 

(I. 09) jHSh = 0, 

indicating that the function Sh is orthogonal to 7 for arbitrary h. 

We now turn to problem (B) where f is given as a functional in g 
through (I. 01). Upon inserting f as given by (I. 01) into (I.07) we obtain 
(I. 10) q = <«j + — S{q0(q")}, 
an integral equation which must be satisfied by every solution q of (B). 
However, a function satisfying (I. 10) is a solution of (B) only if the con- 
dition (I. 06) is satisfied, which, upon using (I. 01), takes the form 

jHq — jH{q0 (q°)} = 0. 
This “ bifurcation equation” is to be considered as a relation between p» and «. 
By virtue of (1.07), (1.04), and (I. 09) it reduces to 
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(I. 11) pe — jH{qO(q?)} =0. 


As we are interested in solutions in the neighborhood of «0, q=0, 
it is appropriate to introduce a new function ¢ through 


(I. 12) gq =«j +t, where ¢, of course, depends on «. 


In view of (I. 06) equation (I. 10) becomes 


(I. 13) wSt—eS{(j + t)0(j+4)%), 
while the bifurcation equation (I. 11) assumes the form 
(I. 14) e[u—ejH{ (j + t)O(j + t)*}] —0. 


We exclude the trivial solution « = 0, t= =0 valid for any Then (I. 14) 
reduces to 


(I. 15) p—ejH{(j + 


Once the relations (I. 13) and (I. 15) have been obtained, the procedure 
of E. Schmidt can be applied: 1) e« and p» are considered as independent 
parameters and the integral equation (I.13) is solved ** on this basis; 
2) the solution ¢t(r;e,m) of (I.13) is inserted in the relation (I. 15) ; 
p» is then considered a function p(e*) of € through the resulting transcendental 
equation, the bifurcation equation; 3) inserted in (I. 12) 
determines finally the solution of problem (B). 

Following the procedure of E. Schmidt, it is possible to show that (I. 13) 
has a solution which is a power series in » and ¢* for small enough p» and e. 
The relation between p and « derived from (I. 15) will then be such that p 
can be expressed as a power series in ¢”, starting with the term e?7H70(j") 
which does not vanish.*® That is: » = p— p® is a power series in ¢? and q 
a series in odd powers of «. This was assumed in working with the perturbation 
method (sec. 2). 
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88 Note that O(q*?) and Sq are continuous even at r= 0 if q has this property. 
Hence iterations on (1.13) are applicable if ¢ is assumed to be continuous for 
OSrsl. 

89 From the definition of O(q?) we may write 


jHjO(j?) = (9°) 9? (12) 
0 


where 1, = max (1,12). 
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